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BASB027 PROTEINS AND GENES FROM M OR AX ELLA CA TARRHALIS. ANTIGENS. ANTIBODIES. AND USES 



FIELD OF THE INVENTION 

This invention relates to polynucleotides, (herein referred to as " BASB027 
polynucleotide^) 7 *), polypeptides encoded by them (referred to herein as "BASB027" or 
" BASB027 polypeptide(s)'*), recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including vaccines against bacterial infections. In a further aspect, the 
invention relates to diagnostic assays for detecting infection of certain pathogens. 

BACKGROUND OF THE INVENTION 

Moraxella catarrhalis (also named Branhamella catarrhalis) is a Gram negative bacteria 
frequently isolated from the human upper respiratory tract. It is responsible for several 
pathologies the main ones being otitis media in infants and children, and pneumonia in 
elderlies. It is also responsible of sinusitis, nosocomial infections and less frequently of 
invasive diseases. 



Otitis media is an important childhood disease both by the number of cases and its potential 
sequelae. More than 3.5 millions cases are recorded every year in the United States, and it is 
estimated that 80 % of the children have experienced at least one episode of otitis before 
reaching the age of 3 (Klein, JO (1994) Clin.Inf.Dis 19:823). Left untreated, or becoming 
chronic, this disease may lead to hearing losses that could be temporary (in the case of fluid 
accumulation in the middle ear) or permanent (if the auditive nerve is damaged). In infants, 
such hearing losses may be responsible for a delayed speech learning. 

Three bacterial species are primarily isolated from the middle ear of children with otitis 
media: Streptococcus pneumoniae* non typeable Haemophilus influenza (NTHi) and M 
catarrhalis. They are present in 60 to 90 % of the cases. A review of recent studies shows 
that 5. pneumoniae and NTHi represent both about 30 %, and M catarrhalis about 1 5 % of 
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the otitis media cases (Murphy. TF (1996) Microbiol. Rev. 60:267). Other bacteria could be 
isolated from the middle ear (H. influenza type B, S. pyogenes etc) but at a much lower 
frequency (2 % of the cases or less). 

Epidemiological data indicate that, for the pathogens found in the middle ear, the 
colonization of the upper respirator}' tract is an absolute prerequisite for the development of 
an otitis; other are however also required to lead to the disease (Dickinson. DP et al. (1988) 
J. Infect.Dis. 158:205, Faden. HL et al. ( 1991) Ann.Otorhinol.LaryngoI. 100:612). These are 
important to trigger the migration of the bacteria into the middle ear via the Eustachian 
tubes, followed by the initiation of an inflammatory process. These factors are unknown 
todate. It has been postulated that a transient anomaly of the immune system following a 
viral infection, for example, could cause an inability to control the colonization of the 
respiratory tract (Faden. HL et al (1994) J. Infect.Dis. 169:1312). An alternative explanation 
is that the exposure to environmental factors allow a more important colonization of some 
children, who subsequently become susceptible to the development of otitis media because 
of the sustained presence of middle ear pathogens (Murphy, TF (1996) Microbiol.Rev. 
60:267). 

The immune response to M. catarrhalis is poorly characterized. The analysis of strains 
isolated sequentially from the nasopharynx of babies followed from 0 to 2 years of age, 
indicates that they get and eliminate frequently new strains. This indicates that an 
efficacious immune response against this bacteria is mounted by the colonized children 
(Faden, HL et al (1994) J. Infect.Dis. 169:1312). 

In most adults tested, bactericidal antibodies have been identified (Chapman, AJ et al. 
(1985) J. Infect.Dis. 1 5 1 :878). Strains of M catarrhalis present variations in their capacity 
to resist serum bactericidal activity: in general, isolates from diseased individuals are more 
resistant than those who are simply colonized (Hoi, C et al. (1993) Lancet 341:1281, Jordan, 
KL et al. (1990) Am. J.Med. 88 (suppl. 5A):28S). Serum resistance could therfore be 



WO 99/63093 




PCT/EP99/03822 



considered as a virulence factor of the bacteria. An opsonizing activity has been observed in 
the sera of children recovering from otitis media. 

The antigens targetted by these different immune responses in humans have not been 
identified, with the exception of OMP Bl. a 84 kDa protein which expression is regulated 
by iron, and that is recognized by the sera of patients with pneumonia (Sethi, S. et al. (1995 ) 
Infect.Immun. 63:1516) , and of UspAl and UspA2 (Chen D. et al.(1999). Infect.Immun. 
67:1310). 

A few other membrane proteins present on the surface of AL catarrhal is have been 
characterized using biochemical method, or for their potential implication in the induction of 
a protective immunity (for review, see Murphy. TF (1996) Microbiol. Rev. 60:267). In a 
mouse pneumonia model, the presence of antibodies raised against some of them (UspA. 
CopB) favors a faster clearance of the pulmonary infection. Another polypeptide (OMP CD) 
is highly conserved among M. catarrhalis strains, and presents homologies with a porin of 
Pseudomonas aeruginosa, which has been demonstrated efficacious against this bacterium 
in animal models. 

The frequency of Moraxella catarrhalis infections has risen dramatically in the past few- 
decades. This has been attributed to the emergence of multiply antibiotic resistant strains 
and an increasing population of people with weakened immune systems. It is no longer 
uncommon to isolate Moraxella catarrhalis strains that are resistant to some or all of the 
standard antibiotics. This phenomenon has created an unmet medical need and demand for 
new anti-microbial agents, vaccines, drug screening methods, and diagnostic tests for this 
organism. 
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The present invention relates to BASB027, in particular BASB027 polypeptides and 
B ASB027 polynucleotides, recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including prevention and treatment of microbial diseases, amongst others. 
In a further aspect, the invention relates to diagnostic assays for detecting diseases 
associated with microbial infections and conditions associated with such infections, such 
as assays for detecting expression or activity of BASB027 polynucleotides or 
polypeptides. 

Various changes and modifications within the spirit and scope of the disclosed invention 
will become readily apparent to those skilled in the art from reading the following 
descriptions and from reading the other pans of the present disclosure. 

DESCRIPTION OF THE INVENTION 

The invention relates to BASB027 polypeptides and polynucleotides as described in greater 
detail below. In particular, the invention relates to polypeptides and polynucleotides of 
BASB027 of Moraxella catarrhalis, which is related by amino acid sequence homology to 
Neisseria meningitidis OMP85 outer membrane protein. The invention relates especially to 
BASB027 having the nucleotide and amino acid sequences set out in SEQ ID NO:l or 3 and 
SEQ ID NO:2 or 4 respectively. It is understood that sequences recited in the Sequence 
Listing below as "DNA ,? represent an exemplification of one embodiment of the 
invention, since those of ordinary skill will recognize that such sequences can be usefully 
employed in polynucleotides in general, including ribopolynucleotides. 

Polypeptides 

In one aspect of the invention there are provided polypeptides of Moraxella catarrhalis 
referred to herein as "BASB027" and "BASB027 polypeptides" as well as biologically, 
diagnostically, prophylactically, clinically or therapeutically useful variants thereof, and 
compositions comprising the same. 
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The present invention further provides for: 

(a) an isolated polypeptide which comprises an amino acid sequence which has at least 
85% identity, preferably at least 90% identity, more preferably at least 95% identity, most 
preferably at least 97-99% or exact identity, to that of SEQ ID NO:2 or 4; 

(b) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence which has at least 85% identity, preferably at least 90% identity, more 
preferably at least 95% identity, even more preferably at least 97-99% or exact identity to 
SEQ ID NO:l or 3 over the entire length of SEQ ID NO:l or 3 respectively; or 

(c) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence encoding a polypeptide which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity, to the amino acid sequence of SEQ ID NO:2 or 4. 

The BASB027 polypeptides provided in SEQ ID NO:2 or 4 are the BASB027 
polypeptides from Moraxella catarrhalis strain Mc293 1 (ATCC 4361 7). 

The invention also provides an immunogenic fragment of a BASB027 polypeptide, that 
is, a contiguous portion of the BASB027 polypeptide which has the same or substantially 
the same immunogenic activity as the polypeptide comprising the amino acid sequence of 
SEQ ID NO:2 or 4; That is to say, the fragment (if necessary-* when coupled to a carrier) is 
capable of raising an immune response which recognises the BASB027 polypeptide. 
Such an immunogenic fragment may include, for example, the BASB027 polypeptide 
lacking an N-terminal leader sequence, and/or a transmembrane domain and/or a C- 
terminal anchor domain. In a preferred aspect the immunogenic fragment of BASB027 
according to the invention comprises substantially all of the extracellular domain of a 
polypeptide which has at least 85% identity, preferably at least 90% identity, more 
preferably at least 95% identity, most preferably at least 97-99% identity, to that of SEQ 
ID NO:2 or 4 over the entire length of SEQ ID NO:2 
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A fragment is a polypeptide having an amino acid sequence that is entirely the same as pan 
but not all of any amino acid sequence of any polypeptide of the invention. As with 
BASB027 polypeptides, fragments may be "free-standing," or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a single continuous 
region in a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of an 
amino acid sequence of SEQ ID NO:2 or 4 or of variants thereof such as a continuous series 
of residues that includes an amino- and/or carboxyl-terminal amino acid sequence. 
Degradation forms of the polypeptides of the invention produced by or in a host cell, are 
also preferred. Further preferred are fragments characterized by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, 
beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and 
high antigenic index regions. - 

Further preferred fragments include an isolated polypeptide comprising an amino acid 
sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids from the 
amino acid sequence of SEQ ID NO: 2 or 4, or an isolated polypeptide comprising an 
amino acid sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids 
truncated or deleted from the amino acid sequence of SEQ ID NO: 2 or 4. 

Fragments of the polypeptides of the invention may be employed for producing the 
corresponding full-length polypeptide by peptide synthesis; therefore, these fragments 
may be employed as intermediates for producing the full-length polypeptides of the 
invention. 
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Particularly preferred are variants in which several. 5-10. 1-5. 1-3, 1-2 or 1 amino acids 
are substituted, deleted, or added in any combination. 

The polypeptides, or immunogenic fragments, of the invention may be in the form of 
the "mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
during recombinant production. Furthermore, addition of exogenous polypeptide or 
lipid tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect, the invention relates to genetically engineered soluble fusion proteins 
comprising a polypeptide of the present invention, or a fragment thereof, and various 
portions of the constant regions of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a particular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. 

Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion 
proteins allowing increased levels to be produced in an expression system as compared 
to non-fused protein. The fusion partner may assist in providing T helper epitopes 
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(immunological fusion partner), preferably T helper epitopes recognised by humans, or 
assisi in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological 
fusion partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenzae, and the non-structural 
protein from influenzae virus. NS1 (hemagglutinin). Another fusion partner is the 
protein known as LytA. Preferably the C terminal portion of the molecule is used. Lyta 
is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L-alanine 
amidase. amidase LytA, (coded by the lytA gene {Gene, 43 (1986) page 265-272} ) an 
autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The 
C-terminal domain of the LytA protein is responsible for the affinity to the choline or to 
some choline analogues such as DEAE. This property has been exploited for the 
development of E.coli C-LytA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LytA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-798}. It is possible 
to use the repeat portion of the LytA molecule found in the C terminal end starting at 
residue 1 78. for example residues 1 88 - 305. 

The present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the referents by conservative amino acid substitutions, 
whereby a residue is substituted by another with like characteristics. Typical such 
substitutions are among Ala, Val, Leu and He; among Ser and Thr; among the acidic 
residues Asp and Glu; among Asn and Gin; and among the basic residues Lys and Arg; or 
aromatic residues Phe and Tyr. 

Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
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combination of these methods. Means for preparing such polypeptides are well 
understood in the art. 

It is most preferred that a polypeptide of the invention is derived from Moraxella 
catarrhalis. however, it may preferably be obtained from other organisms of the same 
taxonomic genus. A polypeptide of the invention may also be obtained, for example, from 
organisms of the same taxonomic family or order. 

Polynucleotides 

It is an object of the invention to provide polynucleotides that encode BASB027 
polypeptides, particularly polynucleotides that encode the polypeptide herein designated 
BASB027. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding BASB027 polypeptides comprising a sequence set out in SEQ ID NO:l or 
3 which includes a full length gene, or a variant thereof. 

The BASB027 polynucleotides provided in SEQ ID NO:l or 3 are the BASB027 
polynucleotides from Moraxella catarrhalis strain Mc2931 (ATCC 43617). 

As a further aspect of the invention there are provided isolated nucleic acid molecules 
encoding and/or expressing BASB027 polypeptides and polynucleotides, particularly 
Moraxella catarrhalis BASB027 polypeptides and polynucleotides, including, for 
example, unprocessed RNAs, ribozyme RNAs, mRNAs, cDNAs, genomic DNAs, B- 
and Z-DNAs. Further embodiments of the invention include biologically, 
diagnostically, prophylactically, clinically or therapeutically useful polynucleotides and 
polypeptides, and variants thereof, and compositions comprising the same. 
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Another aspect of the invention relates to isolated polynucleotides, including at least one full 
length gene, that encodes a BASB027 polypeptide having a deduced amino acid sequence of 
SEQ ID NO:2 or 4 and polynucleotides closely related thereto and variants thereof. 

In another particularly preferred embodiment of the invention there is a BASB027 
polypeptide from Moraxella catarrhal is comprising or consisting of an amino acid 
sequence of SEQ ID NO:2 or 4 or a variant thereof. 

Using the information provided herein, such as a polynucleotide sequence set out in SEQ ID 
NO:l or 3. a polynucleotide of the invention encoding BASB027 polypeptide may be 
obtained using standard cloning and screening methods, such as those for cloning and 
sequencing chromosomal DNA fragments from bacteria using Moraxella catarrhalis Catlin 
cells as starting material, followed by obtaining a full length clone. For example, to obtain a 
polynucleotide sequence of the invention, such as a polynucleotide sequence given in 
SEQ ID NO:l or 3, typically a library of clones of chromosomal DNA of Moraxella 
catarrhalis Catlin in E.coli or some other suitable host is probed with a radiolabeled 
oligonucleotide, preferably a 17-mer or longer, derived from a partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using stringent 
hybridization conditions. By sequencing the individual clones thus identified by 
hybridization with sequencing primers designed from the original polypeptide or 
polynucleotide sequence it is then possible to extend the polynucleotide sequence in both 
directions to determine a full length gene sequence. Conveniently, such sequencing is 
performed, for example, using denatured double stranded DNA prepared from a plasmid 
clone. Suitable techniques are described by Maniatis, T., Fritsch, E.F. and Sambrook et 
al., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor 
Laboratory Press. Cold Spring Harbor, New* York (1989). (see in particular Screening By 
Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70). 
Direct genomic DNA sequencing may also be performed to obtain a full length gene 
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sequence. Illustrative of the invention, each polynucleotide set out in SEQ ID NO: 1 or 3 
was discovered in a DNA library derived from Moraxella cararrhalis. 

Moreover, each DNA sequence set out in SEQ ID NO: 1 or 3 contains an open reading frame 
encoding a protein having about the number of amino acid residues set forth in SEQ ID 
NO:2 or 4 with a deduced molecular weight that can be calculated using amino acid residue 
molecular weight values well known to those skilled in the art. 

The polynucleotide of SEQ ID NO:l. between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 2440 of SEQ ID NO: 1 . encodes the 
polypeptide of SEQ ID NO:2. 

The polynucleotide of SEQ ID NO:3, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 2440 of SEQ ID NO:3. encodes the 
polypeptide of SEQ ID NO:4. 

In a further aspect, the present invention provides for an isolated polynucleotide : 
comprising or consisting of: 

(a) a polynucleotide sequence which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity to SEQ ID NO: 1 or 3 over the entire length of SEQ ID NO: 1 or 3 
respectively; or 

(b) a polynucleotide sequence encoding a polypeptide which has at least 85% identity, 
preferably at least 90% identity, more preferably at least 95% identity, even more 
preferably at least 97-99% or 100% exact, to the amino acid sequence of SEQ ID NO:2 
or 4. over the entire length of SEQ ID NO:2 or 4 respectively. 

A polynucleotide encoding a polypeptide of the present invention, including homologs and 
orthologs from species other than Moraxella catarrhalis, may be obtained by a process 
which comprises the steps of screening an appropriate library under stringent hybridization 
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conditions (for example, using a temperature in the range of 45 - 65°C and an SDS 
concentration from 0.1 - 1 %) with a labeled or detectable probe consisting of or comprising 
the sequence of SEQ ID NO:l or 3 or a fragment thereof: and isolating a full-length gene 
and/or genomic clones containing said polynucleotide sequence. 

The invention provides a polynucleotide sequence identical over its entire length to a coding 
sequence (open reading frame) in SEQ ID NO:l or 3. Also provided by the invention is a 
coding sequence for a mature polypeptide or a fragment thereof, by itself as well as a coding 
sequence for a mature polypeptide or a fragment in reading frame with another coding 
sequence, such as a sequence encoding a leader or secretory sequence, a pre-, or pro- or 
prepro-protein sequence. The polynucleotide of the invention may also contain at least one 
non-coding sequence, including for example, but not limited to at least one non-coding 5' 
and 3*- sequence, such as the transcribed but non-translated sequences, termination signals 
(such as rho-dependent and rho-independent termination signals), ribosome binding sites. 
Kozak sequences, sequences that stabilize mRNA, introns, and polyadenylation signals. 
The polynucleotide sequence may also comprise additional coding sequence encoding 
additional amino acids. For example, a marker sequence that facilitates purification of the 
fused polypeptide can be encoded. In certain embodiments of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and 
described in Gentz et al, Proc. Natl Acad. ScL, USA 86: 821-824 (1989), or an HA peptide 
tag (Wilson et a/.. Cell 37: 767 (1984), both of which may be useful in purifying 
polypeptide sequence fused to them. Polynucleotides of the invention also include, but are 
not limited to. polynucleotides comprising a structural gene and its naturally associated 
sequences that control gene expression. 

The nucleotide sequence encoding BASB027 polypeptide of SEQ ID NO:2 or 4 may be 
identical to the polypeptide encoding sequence contained in nucleotides 1 to 2439 of SEQ 
ID NO: 1 or 3 respectively. Alternatively it may be a sequence, which as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID 
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NO:2 or 4. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses 
polynucleotides that include a sequence encoding a polypeptide of the invention, 
particularly a bacterial polypeptide and more particularly a polypeptide of the Moraxclia 
ccuarrhalis BASB027 having an amino acid sequence set out in SEQ ID NO:2 or 4. The 
term also encompasses polynucleotides that include a single continuous region or 
discontinuous regions encoding the polypeptide (for example, polynucleotides interrupted 
by integrated phage, an integrated insertion sequence, an integrated vector sequence, an 
integrated transposon sequence, or due to RNA editing or genomic DNA reorganization) 
together with additional regions, that also may contain coding and/or non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that encode 
variants of a polypeptide having a deduced amino acid sequence of SEQ ID NO:2 or 4. 
Fragments of polynucleotides of the invention may be used, for example, to synthesize full- 
length polynucleotides of the invention. 

Further particularly preferred embodiments are polynucleotides encoding BASB027 
variants, that have the amino acid sequence of BASB027 polypeptide of SEQ ID NO:2 or 4 
in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2. 1 or no amino acid residues are substituted, 
modified, deleted and/or added, in any combination. Especially preferred among these are 
silent substitutions, additions and deletions, that do not alter the properties and activities of 
BASB027 polypeptide. 

Further preferred embodiments of the invention are polynucleotides that are at least 85% 
identical over their entire length to a polynucleotide encoding BASB027 polypeptide having 
an amino acid sequence set out in SEQ ID NO:2 or 4, and polynucleotides that are 
complementary to such polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 90% identical over its entire length to 
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a polynucleotide encoding BASB027 polypeptide and polynucleotides complementary 
thereto. In this regard, polynucleotides at least 95% identical over their entire length to the 
same are particularly preferred. Furthermore, those with at least 97% are highly preferred 
among those with at least 95%. and among these those with at least 98% and at least 99% 
are particularly highly preferred, with at least 99% being the more preferred. 

Preferred embodiments are polynucleotides encoding polypeptides that retain substantially 
the same biological function or activity as the mature polypeptide encoded by a DNA of 
SEQ IDNO:l or 3. 



In accordance with certain preferred embodiments of this invention there are provided 
polynucleotides that hybridize, particularly under stringent conditions, to BASB027 
polynucleotide sequences, such as those polynucleotides in SEQ ID NO:l or 3. 

The invention further relates to polynucleotides that hybridize to the polynucleotide 
sequences provided herein. In this regard, the invention especially relates to polynucleotides 
that hybridize under stringent conditions to the polynucleotides described herein. As herein 
used, the terms "stringent conditions" and "stringent hybridization conditions" mean 
hybridization occurring only if there is at least 95% and preferably at least 97% identity 
between the sequences. A specific example of stringent hybridization conditions is 
overnight incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x.Denhardt's 
solution, 1 0% dextran sulfate, and 20 micrograms/ml of denatured, sheared salmon sperm 
DNA, followed by washing the hybridization support in 0.1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, et aL, 
Molecular Cloning: A Laboratory Manual, Second Edition. Cold Spring Harbor, N.Y., 
(1989), particularly Chapter 1 1 therein. Solution hybridization may also be used with the 
polynucleotide sequences provided by the invention. 
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The invention also provides a polynucleotide consisting of or comprising a polynucleotide 
sequence obtained by screening an appropriate library containing the complete gene for a 
polynucleotide sequence set forth in SEQ ID NO:l or 3 under stringent hybridization 
conditions with a probe having the sequence of said polynucleotide sequence set forth in 
SEQ ID NO:l or 3 or a fragment thereof; and isolating said polynucleotide sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers fully described elsewhere herein. 

As discussed elsewhere herein regarding polynucleotide assays of the invention, for 
instance, the polynucleotides of the invention, may be used as a hybridization probe for 
RNA. cDN A and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
BASB027 and to isolate cDNA and genomic clones of other genes that have a high identity, 
particularly high sequence identity, to the BASB027 gene. Such probes generally will 
comprise at least 1 5 nucleotide residues or base pairs. Preferably, such probes will have at 
least 30 nucleotide residues or base pairs and may have at least 50 nucleotide residues or . 
base pairs. Particularly preferred probes will have at least 20 nucleotide residues or base 
pairs and will have less than 30 nucleotide residues or base pairs. 

A coding region of a BASB027 gene may be isolated by screening using a DNA sequence 
provided in SEQ ID NO:l or 3 to synthesize an oligonucleotide probe. A labeled 
oligonucleotide having a sequence complementary to that of a gene of the invention is then 
used to screen a library of cDNA, genomic DNA or mRNA to determine which members of 
the library the probe hybridizes to. 

There are several methods available and well known to those skilled in the art to obtain 
full-length DNAs, or extend short DNAs, for example those based on the method of Rapid 
Amplification of cDNA ends (RACE) (see, for example. Frohman, et aL, PNAS USA 85: 
8998-9002, 1988). Recent modifications of the technique, exemplified by the Marathon™ 
technology (Clontech Laboratories Inc.) for example, have significantly simplified the 
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search for longer cDNAs. In the Marathon™ technology, cDNAs have been prepared 
from mRNA extracted from a chosen tissue and an 'adaptor' sequence ligated onto each 
end. Nucleic acid amplification (PCR) is then carried out to amplify the "missing" 5' end 
of the DNA using a combination of gene specific and adaptor specific oligonucleotide 
primers. The PCR reaction is then repeated using "nested" primers, that is. primers 
designed to anneal within the amplified product (typically an adaptor specific primer that 
anneals further 3' in the adaptor sequence and a gene specific primer that anneals further 5' 
in the selected gene sequence). The products of this reaction can then be analyzed by 
DNA sequencing and a full-length DNA constructed either by joining the product directly 
to the existing DNA to give a complete sequence, or carrying out a separate full-length 
PCR using the new sequence information for the design of the 5 1 primer. 

The polynucleotides and polypeptides of the invention may be employed, for example, as 
research reagents and materials for discovery of treatments of and diagnostics for diseases, 
particularly human diseases, as further discussed herein relating to polynucleotide assays. 

The polynucleotides of the invention that are oligonucleotides derived from a sequence of 
SEQ ID NOS:l or 3 may be used in the processes herein' as described, but preferably for 
PCR, to determine whether or not the polynucleotides identified herein in whole or in part 
are transcribed in bacteria in infected tissue. It is recognized that such sequences will also 
have utility in diagnosis of the stage of infection and type of infection the pathogen has 
attained. 

The invention also provides polynucleotides that encode a polypeptide that is the mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids interior to 
the mature polypeptide (when the mature form has more than one polypeptide chain, for 
instance). Such sequences may play a role in processing of a protein from precursor to a 
mature form, may allow protein transport, may lengthen or shorten protein half-life or may 
facilitate manipulation of a protein for assay or production, among other things. As 
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generally is the case in vivo, the additional amino acids may be processed away from the 
mature protein by cellular enzymes. 

For each and every polynucleotide of the invention there is provided a polynucleotide 
complementary to it. It is preferred that these complementary polynucleotides are fully 
complementary to each polynucleotide with which they are complementary. 

A precursor protein, having a mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

In addition to the standard A, G, C, TAJ representations for nucleotides, the term "N M may 
also be used in describing certain polynucleotides of the invention. "N" means that any of 
the four DNA or RN A nucleotides may appear at such a designated position in the DNA 
or RNA sequence, except it is preferred that N is not a nucleic acid that when taken in 
combination with adjacent nucleotide positions, when read in the correct reading frame, 
would have the effect of generating a premature termination codon in such reading frame. 

In sum, a polynucleotide of the invention may encode a mature protein, a mature protein 
plus a leader sequence ( which may be referred to as a preprotein), a precursor of a mature 
protein having one or more prosequences that are not the leader sequences of a preprotein, 
or a preproprotein. which is a precursor to a proprotein, having a leader sequence and one or 
more prosequences. which generally are removed during processing steps that produce 
active and mature forms of the polypeptide. 

In accordance with an aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. 
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The use of a polynucleotide of the invention in genetic immunization will preferably 
employ a suitable deliver}' method such as direct injection of plasmid DNA into muscles 
(Wolff et oL. Hum Mo I Genet (1992) 1 : 363. Manthorpe et aL. Hum. Gene Ther. ( 19S3) 4: 
419). delivery of DNA complexed with specific protein . carriers (Wu et al. J Biol Chem. 
(1989 ) 264: 16985). coprecipitation of DNA with calcium phosphate (Benvenisty & 
Reshef. /WAS USA. (1986) 83: 9551). encapsulation of DNA in various forms of 
liposomes (Kaneda et aL. Science (1989) 243: 375). particle bombardment (Tang et aL, 
Nature (1992) 356:152, Eisenbraun et aL, DNA Cell Biol (1 993) 12: 791) and in vivo 
infection using cloned retroviral vectors (Seeger et aL, PNAS USA (1984) 81 : 5849). 



Vectors, Host Cells, Expression Systems 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of 
the invention, host cells that are genetically engineered with vectors of the invention and the 
production of polypeptides of the invention by recombinant techniques. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the invention. 

Recombinant polypeptides of the present invention may be prepared by processes well 
known in those skilled in the art from genetically engineered host cells comprising 
expression systems. Accordingly, in a further aspect, the present invention- relates to 
expression systems that comprise a polynucleotide or polynucleotides of the present 
invention, to host cells which are genetically engineered with such expression systems, and 
to the production of polypeptides of the invention by recombinant techniques. 

For recombinant production of the polypeptides of the invention, host cells can be 
genetically engineered to incorporate expression systems or portions thereof or 
polynucleotides of the invention. Introduction of a polynucleotide into the host cell can be 
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effected by methods described in many standard laboratory manuals, such as Davis, et aL. 
BASIC METHODS IN MOLECULAR BIOLOGY. ( 1 986) and Sambrook. et aL . 
MOLECULAR CLONING: A LABORATORY MANUAL. 2nd Ed.. Cold Spring Harbor 
Laboratory Press. Cold Spring Harbor, N.Y. (1989), such as. calcium phosphate 
transfection. DEAE-dextran mediated transfection. transvection. microinjection, cationic 
lipid-mediated transfection. eiectroporation. transduction, scrape loading, ballistic 
introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as cells of 
streptococci, staphylococci, enterococci, £*. coli, streptomyces, cyanobacteria. Bacillus 
subtilis. Neisseria meningitidis and Moraxella catarrhalis: fungal cells, such as cells of a 
yeast. Kluveromyces. Saccharomyces. a basidiomycete, Candida albicans and Aspergillus: 
insect cells such as cells of Drosophila S2 and Spodoptera Sf9; animal cells such as CHO. 
COS, HeLa, CI 27. 3T3, BHK, 293, CV-1 and Bowes melanoma cells; and plant cells, such 
as cells of a gymnosperm or angiosperm. 

A great variety of expression systems can be used to produce the polypeptides of the 
invention. Such vectors include, among others, chromosomal-, episomal- and virus-derived 
vectors, for example, vectors derived from bacterial plasmids, from bacteriophage, from 
transposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses, picornaviruses, retroviruses, 
and alphaviruses and vectors derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression system constructs may contain control regions that regulate as well as engender 
expression. Generally, any system or vector suitable to maintain, propagate or express 
polynucleotides and/or to express a polypeptide in a host may be used for expression in this 
regard. The appropriate DNA sequence may be inserted into the expression system by any 
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of a variety of well-known and routine techniques, such as. for example, those set forth in 
Sambrook ct a/.. MOLECULAR CLONING. A LABORATORY MANUAL, (supra). 

In recombinant expression systems in eukaryotes. for secretion of a translated protein into 
the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular 
environment, appropriate secretion signals may be incorporated into the expressed 
polypeptide. These signals may be endogenous to the polypeptide or they may be 
heterologous signals. 

Polypeptides of the present invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. Most preferably, ion metal 
affinity chromatography (IMAC) is employed for purification. Well known techniques 
for refolding proteins may be employed to regenerate active conformation when the 
polypeptide is denatured during intracellular synthesis, isolation and or purification. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
vivo expression of the antigen and induction of immune responses. Viruses and bacteria 
used for this purpose are for instance: poxviruses (e.g; vaccinia, fowlpox, canarypox), 
alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian Equine Encephalitis 
Virus), adenoviruses, adeno-associated virus, picornaviruses (poliovirus, rhinovirus). 
herpesviruses ( varicella zoster virus, etc), Listeria, Salmonella , Shigella, BGG. These 
viruses and bacteria can be virulent, or attenuated in various ways in order to obtain live 
vaccines. Such live vaccines also form part of the invention. 
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Diagnostic, Prognostic. Serotyping and Mutation Assays 

This invention is also related to the use of BASB027 polynucleotides and polypeptides of 
the invention for use as diagnostic reagents. Detection of BASB027 polynucleotides and/or 
polypeptides in a eukaryote, particularly a mammal, and especial ly a human, will provide a 
diagnostic method for diagnosis of disease, staging of disease or response of an infectious 
organism to drugs. Eukaryotes. particularly mammals, and especially humans, particularly 
those infected or suspected to be infected with an organism comprising the BASB027 gene 
or protein, may be detected at the nucleic acid or amino acid level by a variety of well 
known techniques as well as by methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be obtained 
from a putatively infected and/or infected individual's bodily materials. Polynucleotides 
from any of these sources, particularly DNA or RNA, may be used directly for detection or 
may be amplified enzymatically by using PCR or any other amplification technique prior to 
analysis. RNA, particularly mRNA. cDNA and genomic DNA may also be used in the 
same ways. Using amplification, characterization of the species and strain of infectious or 
resident organism present in an individual, may be made by an analysis of the genotype of a 
selected polynucleotide of the organism. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to a genotype of a reference sequence 
selected from a related organism, preferably a different species of the same genus or a 
different strain of the same species. Point mutations can be identified by hybridizing 
amplified DNA to labeled BASB027 polynucleotide sequences. Perfectly or significantly 
matched sequences can be distinguished from imperfectly or more significantly mismatched 
duplexes by DNase or RNase digestion, for DNA or RNA respectively, or by detecting 
differences in melting temperatures or renaturation kinetics. Polynucleotide sequence 
differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide fragments in gels as compared to a reference sequence. This may be carried 
out with or without denaturing agents. Polynucleotide differences may also be detected by 
direct DNA or RNA sequencing. See, for example, Myers ei ai. Science, 230: 1242 (1985). 
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Sequence changes at specific locations also may be revealed by nuclease protection assays, 
such as RNase. VI and SI protection assay or a chemical cleavage method. See, for 
example. Cotton ct aL. Proc. Natl Acad ScL USA, 85: 4397-4401 (1985). 

In another embodiment, an array of oligonucleotides probes comprising BASB027 
nucleotide sequence or fragments thereof can be constructed to conduct efficient screening 
of. for example, genetic mutations, serotype, taxonomic classification or identification. 
Array technology methods are well known and have general applicability and can be used to 
address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see, for example, Chee et al. Science, 274: 610 (1996)). 

Thus in another.aspect, the present invention relates to a diagnostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO: 1 or 3, or a fragment thereof ; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO:2 or 
4 or a fragment thereof; or- . 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQIDNO:2 or 4. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. Such a kit will be of use in diagnosing a disease or susceptibility to a 
Disease, among others. 

This invention also relates to the use of polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of a polynucleotide of the invention, 
preferably SEQ ID NO:l or 3, which is associated with a disease or pathogenicity will 
provide a diagnostic tool that can add to, or define, a diagnosis of a disease, a prognosis of a 
course of disease, a determination of a stage of disease, or a susceptibility to a disease. 
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which results from under-expression. over-expression or altered expression of the 
polynucleotide. Organisms, particularly infectious organisms, earn ing mutations in such 
polynucleotide may be detected at the polynucleotide level by a variety of techniques, such 
as those described elsewhere herein. 

Cells from an organism earn ing mutations or polymorphisms (allelic variations) in a 
polynucleotide and/or polypeptide of the invention may also be detected at the 
polynucleotide or polypeptide level by a variety of techniques, to allow for serotyping, for 
example. For example. RT-PCR can be used to detect mutations in the RNA. It is 
particularly preferred to use RT-PCR in conjunction with automated detection systems, such 
as. for example. GeneScan. RNA. cDNA or genomic DNA may also be used for the same 
purpose. PCR. As an example, PCR primers complementary to a polynucleotide encoding 
BASB027 polypeptide can be used to identify and analyze mutations. 

The invention further provides primers with 1, 2, 3 or 4 nucleotides removed from the 5' 
and/or the 3' end. These primers may be used for, among other things, amplifying 
B ASB027 DNA and/or RNA isolated from a sample derived from an individual, such as a 
bodily material. The primers may be used to amplify a polynucleotide isolated from an 
infected individual, such that the polynucleotide may then be subject to various techniques 
for elucidation of the polynucleotide sequence. In this way, mutations . in the polynucleotide 
sequence may be detected and used to diagnose and/or prognose the infection or its stage or 
course, or to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections caused by Moraxella catarrhalis, comprising 
determining from a sample derived from an individual, such as a bodily material, an 
increased level of expression of polynucleotide having a sequence of SEQ ID NO: 1 or 3. 
Increased or decreased expression of a BASB027 polynucleotide can be measured using 
any on of the methods well known in the art for the quantitation of polynucleotides, such 
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as. for example, amplification, PCR, RT-PCR. RNase protection. Northern blotting, 
spectrometry and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of BASB027 polypeptide compared to normal control tissue samples may be 
used to detect the presence of an infection, for example. Assay techniques that can be used 
to determine levels of a BASB027 polypeptide, in a sample derived from a host, such as a 
bodily material, are well-known to those of skill in the an. Such assay methods include 
radioimmunoassays, competitive-binding assays. Western Blot analysis, antibody sandwich 
assays, antibody detection and ELISA assays. 

The polynucleotides of the invention may be used as components of polynucleotide 
arrays, preferably high density arrays or grids.. These high density arrays are 
particularly useful for diagnostic and prognostic purposes. For example, a set of spots 
each comprising a different gene, and further comprising a polynucleotide or 
polynucleotides of the invention, may be used for probing, such as using hybridization 
or nucleic acid amplification, using a probes obtained or derived from a bodily sample, 
to determine the presence of a particular polynucleotide sequence or related sequence in 
an individual. Such a presence may indicate the presence of a pathogen, particularly 
Moraxella catarrhalis, and may be useful in diagnosing and/or prognosing disease or a 
course of disease. A grid comprising a number of variants of the polynucleotide 
sequence of SEQ ID NO: 1 or 3 are preferred. Also preferred is a comprising a number 
of variants of a polynucleotide sequence encoding the polypeptide sequence of SEQ ID 
NO:2or4. 

Antibodies 

The polypeptides and polynucleotides of the invention or variants thereof, or cells 
expressing the same can be used as immunogens to produce antibodies immunospecific for 
such polypeptides or polynucleotides respectively. 
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In certain preferred embodiments of the invention there are provided antibodies against 
BASB027 polypeptides or polynucleotides. 

Antibodies generated against the polypeptides or polynucleotides of the invention can bo 
obtained by administering the polypeptides and/or polynucleotides of the invention, or 
epitope-bearing fragments of either or both, analogues of either or both, or cells expressing 
either or both, to an animal, preferably a nonhuman. using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art that provides 
antibodies produced by continuous cell line cultures can be used. Examples include various 
techniques, such as those in Kohler, G. and Milstein. C, Nature 256: 495-497 (1975); 
Kozbor et a!.. Immunology Today 4: 72 (1983); Cole et ai, pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY. Alan R. Liss, Inc. (1985). 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946 r 778) can be 
adapted to produce single chain antibodies to polypeptides or polynucleotides of this 
invention. Also, transgenic mice, or other organisms or animals, such as other mammals, 
may be used to express humanized antibodies immunospecific to the polypeptides or 
polynucleotides of the invention. 

Alternatively, phage display technology may be utilized to select antibody genes with 
binding activities towards a polypeptide of the invention either from repertoires of PCR 
amplified v-genes of lymphocytes from humans screened for possessing anti-BASB027 or 
from naive libraries (McCafferty, et aL, (1990). Nature 348, 552-554; Marks, et al, 
(1992) Biotechnology 10, 779-783). The affinity of these antibodies can also be improved 
by ? for example, chain shuffling (Clackson et ai. (1991) Nature 352: 628). 



The above-described antibodies, may be employed to isolate or to identity' clones expressing 
the polypeptides or polynucleotides of the invention to purify the polypeptides or 
polynucleotides by. for example, affinity chromatography. 

Thus, among others, antibodies against BASB027-polypeptide or BASB027-poIynucleotide 
may be employed to treat infections, particularly bacterial infections. 

Polypeptide variants include antigenicall\\ epitopically or immunologically equivalent 
variants form a particular aspect of this invention. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the 
individual. For example, if the individual is human the antibody may most preferably be 
"humanized," where the complimentarity determining region or regions of the hybridoma- 
derived antibody has been transplanted into a human monoclonal antibody, for example as 
described in Jones et al (1986). Nature 32 1 ? 522-525 or Tempest et al, (1991) 
Biotechnology 9, 266-273 . 

Antagonists and Agonists - Assays and Molecules 

Polypeptides and polynucleotides of the invention may also be used to assess the binding of 
small molecule substrates and ligands in, for example, cells, cell-free preparations, chemical 
libraries, and natural product mixtures. These substrates and ligands may be natural 
substrates and ligands or may be structural or functional mimetics. See, e.g.. Coligan et ai, 
Current Protocols in Immunology 1 (2): Chapter 5 (1991). 

The screening methods may simply measure the binding of a candidate compound to the 
polypeptide or polynucleotide, or to cells or membranes bearing the polypeptide or 
polynucleotide, or a fusion protein of the polypeptide by means of a label directly or 
indirectly associated with the candidate compound. Alternatively, the screening method 
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may involve competition with a labeled competitor. Further, these screening methods 
may test whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide or polynucleotide, using detection systems appropriate to the 
cells comprising the polypeptide or polynucleotide. Inhibitors of activation are generally 
assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. Constitutively active polypeptide 
and/or constitutive!}' expressed polypeptides and polynucleotides may be employed in 
screening methods for inverse agonists or .inhibitors, in the absence of an agonist or 
inhibitor, by testing whether the candidate compound results in inhibition of activation of 
the polypeptide or polynucleotide, as the case may be. Further, the screening methods 
may simply comprise the steps of mixing a candidate compound with a solution 
containing a polypeptide or polynucleotide of the present invention, to form a mixture, 
measuring BASB027 polypeptide and/or polynucleotide activity in the mixture, and 
comparing the B ASB027 polypeptide and/or polynucleotide activity of the mixture to a 
standard. Fusion proteins, such as those made from Fc portion and BASB027 
polypeptide, as hereinbefore described, can also be used for high-throughput screening 
assays to identify antagonists of the polypeptide of the present invention, as well as of 
phylogenetically and and/or functionally related polypeptides (see D. Bennett et aL, J Mol 
Recognition. 8:52-58 (1995); and K. Johanson et al.. J Biol Chem. 270(16):9459-9471 
(1995)). 

The polynucleotides, polypeptides and antibodies that bind to and/or interact with a 
polypeptide of the present invention may also be used to configure screening methods for 
detecting the effect of added compounds on the production of mRNA and/or polypeptide 
in cells. For example, an ELISA assay may be constructed for measuring secreted or cell 
associated levels of polypeptide using monoclonal and polyclonal antibodies by standard 
methods known in the art. This can be used to discover agents which may inhibit or 
enhance the production of polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 
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The invention also provides a method of screening compounds to identity those which 
enhance (agonist) or block (antagonist) the action of BASB027 polypeptides or 
polynucleotides, particularly those compounds that are bacteriostatic and/or bactericidal. 
The method of screening may involve high-throughput techniques. For example, to screen 
for agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a 
membrane, cell envelope or cell wall, or a preparation of any thereof, comprising BASB027 
polypeptide and a labeled substrate or ligand of such polypeptide is incubated in the absence 
or the presence of a candidate molecule that may be a BASB027 agonist or antagonist. The 
ability of the candidate molecule to agonize or antagonize the BASB027 polypeptide is 
reflected in decreased binding of the labeled ligand or decreased production of product from 
such substrate. Molecules that bind gratuitously, i.e., without inducing the effects of 
BASB027 polypeptide are most likely to be good antagonists. Molecules that bind well 
and, as the case may be, increase the rate of product production from substrate, increase 
signal transduction, or increase chemical channel activity are agonists. Detection of the rate 
or level of, as the case may be, production of product from substrate, signal transduction, or 
chemical channel activity may be enhanced by using a reporter system. Reporter systems 
that may be useful in this regard include but are not limited to colorimetric, labeled substrate 
converted into product, a reporter gene that is responsive to changes in BASB027 
polynucleotide or polypeptide activity, and binding assays known in the art. 

Another example of an assay for BASB027 agonists is a competitive assay that combines 
BASB027 and a potential agonist with BASB027-binding molecules, recombinant 
B ASB027 binding molecules, natural substrates or ligands, or substrate or ligand mimetics, 
under appropriate conditions for a competitive inhibition assay. BASB027 can be labeled, 
such as by radioactivity or a colorimetric compound, such that the number of BASB027 
molecules bound to a binding molecule or converted to product can be determined 
accurately to assess the effectiveness of the potential antagonist. 
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Potential antagonists include, among others, small organic molecules, peptides, polypeptides 
and antibodies that bind to a polynucleotide and/or polypeptide of the invention and thereby 
inhibit or extinguish its activity or expression. Potential antagonists also may be small 
organic molecules, a peptide, a polypeptide such as a closely related protein or antibody thai 
binds the same sites on a binding molecule, such as a binding molecule, without inducing 
BASB027-induced activities, thereby preventing the action or expression of BASB027 
polypeptides and/or polynucleotides by excluding BASB027 polypeptides and/or 
polynucleotides from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding site of 
the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, 
CRC Press, Boca Raton, FL (1988) ? for a description of these molecules). Preferred 
potential antagonists include compounds related to and variants of BASB027. 

In a further aspect, the present invention relates to genetically engineered soluble fusion 
proteins comprising a polypeptide of the present invention, or a fragment thereof, and 
various portions of the constant regions of heavy or light chains of immunoglobulins of 
various subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant 
part of the heavy chain of human IgG ? particularly IgGl. where fusion takes place at the 
hinge region. In a particular embodiment, the Fc pan can be removed simply by 
incorporation of a cleavage sequence which can be cleaved with blood clotting factor Xa. 
Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
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fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/2945S and W094/22914. 

Each of the polynucleotide sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the polynucleotide 
sequences encoding the amino terminal regions of the encoded protein or Shine-Delgarno 
or other translation facilitating sequences of the respective mRNA can be used to 
construct antisense sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide, agonist or 
antagonist of the invention to interfere with the initial physical interaction between a 
pathogen or pathogens and a eukaryotic, preferably mammalian, host responsible for 
sequelae of infection. In particular, the molecules of the invention may be used: in the 
prevention of adhesion of bacteria, in particular gram positive and/or gram negative 
bacteria, to eukaryotic, preferably mammalian, extracellular matrix proteins on in- 
dwelling devices or to extracellular matrix proteins in wounds; to block bacterial adhesion 
between eukaryotic, preferably mammalian, extracellular matrix proteins and bacterial 
BASB027 proteins that mediate tissue damage and/or; to block the normal progression of 
pathogenesis in infections initiated other than by the implantation of in-dwelling devices 
or by other surgical techniques. 

In accordance with yet another aspect of the invention, there are provided BASB027 
agonists and antagonists, preferably bacteristatic or bactericidal agonists and antagonists. 

The antagonists and agonists of the invention may be employed, for instance; to prevent, 
inhibit and/or treat diseases. 
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In a further aspect, the present invention relates to mimotopes of the polypeptide of the 
invention. A mimotope is a peptide sequence, sufficiently similar to the native peptide 
(sequentially or structurally), which is capable of being recognised by antibodies which 
recognise the native peptide; or is capable of raising antibodies which recognise the 
native peptide when coupled to a suitable carrier. 

Peptide mimotopes may be designed for a particular purpose by addition, deletion or 
substitution of elected amino acids. Thus, the peptides may be modified for the purposes 
of ease of conjugation to a protein carrier. For example, it may be desirable for some 
chemical conjugation methods to include a terminal cysteine. In addition it may be 
desirable for peptides conjugated to a protein carrier to include a hydrophobic terminus 
distal from the conjugated terminus of the peptide, such that the free unconjugated end 
of the peptide remains associated with the surface of the carrier protein. Thereby- 
presenting the peptide in a conformation which most closely resembles that of the 
peptide as found in the context of the whole native molecule. For example, the peptides 
may be altered to have an N-terminal cysteine and a C-terminal hydrophobic amidated 
tail. Alternatively, the addition or substitution of a D-stereoisomer form of one or more 
of the amino acids may be performed to create a beneficial derivative, for example to,, 
enhance stability of the peptide. 

Alternatively, peptide mimotopes may be identified using antibodies which are capable 
themselves of binding to the polypeptides of the present invention using techniques such 
as phage display technology (EP 0 552 267 Bl). This technique, generates a large number 
of peptide sequences which mimic the structure of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native polypeptide. 

Vaccines 
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Another aspect of the invention relates to a method for inducing an immunological 
response in an individual, particularly a mammal, preferably humans, which comprises 
inoculating the individual with BASB027 polynucleotide and/or polypeptide, or a 
fragment or variant thereof, adequate to produce antibody and/ or T cell immune response 
to protect said individual from infection, particularly bacterial infection and most 
particularly Moraxella catarrhalis infection. Also provided are methods whereby such 
immunological response slows bacterial replication. Yet another aspect of the invention 
relates to a method of inducing immunological response in an individual which comprises 
delivering to such individual a nucleic acid vector, sequence or ribozyme to direct 
expression of BASB027 polynucleotide and/or polypeptide, or a fragment or a variant 
thereof, for expressing BASB027 polynucleotide and/or polypeptide, or a fragment or a 
variant thereof/;? vivo in order to induce an immunological response, such as. to produce 
antibody and/ or T cell immune response, including, for example, cytokine-producing T 
cells or cytotoxic T cells, to protect said individual, preferably a human, from disease, 
whether that disease is already established within the individual or not. One example of 
administering the gene is by accelerating it into the desired cells as a coating on particles 
or otherwise. Such nucleic acid vector may comprise DNA, RNA, a ribozyme, a modified 
nucleic acid, a DNA/RNA hybrid, a DNA-protein complex or an RNA-protein complex. 

A further aspect of the invention relates to an immunological composition that when 
introduced into an individual, preferably a human, capable of having induced within it an 
immunological response, induces an immunological response in such individual to a 
BASB027 polynucleotide and/or polypeptide encoded therefrom, wherein the composition 
comprises a recombinant BASB027 polynucleotide and/or polypeptide encoded therefrom 
and/or comprises DNA and/or RNA which encodes and expresses an antigen of said 
BASB027 polynucleotide, polypeptide encoded therefrom, or other polypeptide of the 
invention. The immunological response may be used therapeutically or prophylactically 
and may take the form of antibody immunity and/or cellular immunity, such as cellular 
immunity arising from CTL or CD4+ T cells. 
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A BASB027 polypeptide or a fragment thereof may be fused with co-protein or chemical 
moiety which may or may not by itself produce antibodies, but which is capable of 
stabilizing the first protein and producing a fused or modified protein which will have 
antigenic and/or immunogenic properties, and preferably protective properties. Thus 
fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D from Haemophilus influenzae, Glutathione-S-transferase (GST) or beta- 
galactosidase. or any other relatively large co-protein which soiubilizes the protein and 
facilitates production and purification thereof. Moreover, the co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immune system of the 
organism receiving the protein. The co-protein may be attached to either the amino- or 
carboxy-terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and 
methods comprising the polypeptides and/or polynucleotides of the invention and 
immunostimulatory DNA sequences, such as those described in Sato, Y. ei aL Science 
273: 352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or 
particular fragments thereof, which have been shown to encode non-variable regions of 
bacterial cell surface proteins, in polynucleotide constructs used in such genetic 
immunization experiments in animal models of infection with Moraxella catarrhalis. 
Such experiments will be particularly useful for identifying protein epitopes able to 
provoke a prophylactic or therapeutic immune response. It is believed that this approach 
will allow for the subsequent preparation of monoclonal antibodies of particular value, 
derived from the requisite organ of the animal successfully resisting or clearing infection, 
for the development of prophylactic agents or therapeutic treatments of bacterial infection, 
particularly Moraxella catarrhalis infection, in mammals, particularly humans. 
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The invention also includes a vaccine formulation which comprises an immunogenic 
recombinant polypeptide and/or polynucleotide of the invention together -with a suitable 
carrier, such as a pharmaceutically acceptable carrier. Since the polypeptides and 
polynucleotides may be broken down in the stomach, each is preferably administered 
parenteral.lv, including, for example, administration that is subcutaneous, intramuscular, 
intravenous, or intradermal. Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants. 
buffers, bacteriostatic compounds and solutes which render the formulation isotonic with 
the bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The 
formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. 

The vaccine formulation of the invention may also include adjuvant systems for 
enhancing the immunogenicity of the formulation. Preferably the adjuvant system 
raises preferentially a TH1 type of response. 

An immune response may be broadly distinguished into two extreme catagories, being a 
humoral or cell mediated immune responses (traditionally characterised by antibody and 
cellular effector mechanisms of protection respectively). These categories of response 
have been termed TH 1 -type responses (cell-mediated response), and TH2-type immune 
responses (humoral response). 

Extreme THl-type immune responses may be characterised by the generation of antigen 
specific, haplotype restricted cytotoxic T lymphocytes, and natural killer cell responses. 
In mice THl-type responses are often characterised by the generation of antibodies of 
the IgG2a subtype, whilst in the human these correspond to IgGl type antibodies. TH2- 
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type immune responses are characterised by the generation of a broad range of 
immunoglobulin isotypes including in mice IgGl. IgA. and IgM. 

1 1 can be considered that the driving force behind the development of these two types of 
immune responses are cytokines. High levels of THl-type cytokines tend to favour the 
induction of cell mediated immune responses to the given antigen, whilst high levels of 
TH2-type cytokines tend to favour the induction of humoral immune responses to the 
antigen. 

The distinction of TH1 and TH2-type immune responses is not absolute. In reality an 
individual will support an immune response which is described as being predominantly 
TH ! or predominantly TH2. However, it is often convenient to consider the families of 
cytokines in terms of that described in murine CD4 +ve T cell clones by Mosmann and 
Coffman {Mosmann, T.R. and Cqffinan, R.L. (1989) TH1 and TH2 cells: different 
patterns of lymphokine secretion lead to different functional properties. Annual Review 
of Immunology', 7, pi 45-1 73). Traditionally, THl-type responses are associated with 
the production of the INF-y and IL-2 cytokines by T-lymphocytes. Other cytokines * 
often directly associated with the induction of THl-type immune responses are not 
produced by T-cells, such as IL-12. In contrast TH2- type responses are associated with 
the secretion of IL-4 ? IL-5, IL-6 and IL-13. 

It is known that certain vaccine adjuvants are particularly suited to the stimulation of 
either TH1 or TH2 - type cytokine responses. Traditionally the best indicators of the 
TH1 :TH2 balance of the immune response after a vaccination or infection includes 
direct measurement of the production of TH1 or TH2 cytokines by T lymphocytes in 
vitro after restimulation with antigen, and/or the measurement of the IgGl :IgG2a ratio 
of antigen specific antibody responses. 
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Thus, a THl-type adjuvant is one which preferentially stimulates isolated T-cell 
populations toproduce high levels of THl-type cytokines when re-stimulated with 
antigen in vitro, and promotes development of both CDS^- cytotoxic T lymphocytes and 
antigen specific immunoglobulin responses associated with THl-type isotype. 

Adjuvants which are capable of preferential stimulation of the TH1 cell response are 
described in International Patent Application No. WO 94/00153 and WO 95/17209. 

3 De-O-acylated monophosphoryl lipid A (3D-MPL) is one such adjuvant. This is 
known from GB 222021 1 (Ribi). Chemically it is a mixture of 3 De-O-acylated 
monophosphoryl lipid A with 4. 5 or 6 acylated chains and is manufactured by Ribi 
Immunochem. Montana. A preferred form of 3 De-O-acylated monophosphoryl lipid 
A is disclosed in European Patent 0 689 454 Bl (SmithKIine Beecham Biologicals SA). 

Preferably, the particles of 3D-MPL are small enough to be sterile filtered through a 
0.22micron membrane (European Patent number 0 689 454). 
3D-MPL will be present in the range of 10|j.g - lOOjag preferably 25-50^tg per dose 
wherein the antigen will typically be present in a range 2-50|j.g per dose. 

Another preferred adjuvant comprises QS21, an Hplc purified non-toxic fraction derived 
from the bark of Quillaja Saponaria Molina. Optionally this may be admixed with 3 
De-O-acylated monophosphoryl lipid A (3D-MPL), optionally together with an carrier. 

The method of production of QS21 is disclosed in US patent No. 5,057,540. 

Non-reactogenic adjuvant formulations containing QS21 have been described 
previously (WO 96/33739). Such formulations comprising QS21 and cholesterol have 
been shown to be successful TH1 stimulating adjuvants when formulated together with 
an antigen. 
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Further adjuvants which are preferential stimulators of TH1 cell response include 
immunomodulatory oligonucleotides, for example unmethylated CpG sequences as 
disclosed in WO 96/02555. 

Combinations of different TH1 stimulating adjuvants, such as those mentioned 
hereinabove, are also contemplated as providing an adjuvant which is a preferential 
stimulator of TH1 cell response. For example. QS21 can be formulated together with 
3D-MPL. The ratio of QS21 : 3D-MPL will typically be in the order of 1 : 10 to 10: I; 
preferably 1 : 5 to 5 : 1 and often substantially 1:1. The preferred range for optimal 
synergy is 2.5 : 1 to 1 : 1 3D-MPL: QS21. 

Preferably a carrier is also present in the vaccine composition according to the 
invention. The carrier may be an oil in water emulsion, or an aluminium salt, such as 
aluminium phosphate or aluminium hydroxide. 

A preferred oil-in-water emulsion comprises a metabolisible oil, such as squalene. alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccine 
composition according to the invention are combined with QS21 and 3D-MPL in such 
an emulsion. Additionally the oil in water emulsion may contain span 85 and/or lecithin 
and/or tricaprylin. 

Typically for human administration QS21 and 3D-MPL will be present in a vaccine in 
the range of ljag - 200^ig, such as 10-100|ig, preferably lOjag - 50jag per dose. 
Typically the oil in water will comprise from 2 to 10% squalene, from 2 to 10% alpha 
tocopherol and from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha 
tocopherol is equal to or less than 1 as this provides a more stable emulsion. Span 85 
may also be present at a level of 1%. In some cases it may be advantageous that the 
vaccines of the present invention will further contain a stabiliser. 
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Non-toxic oil in water emulsions preferably contain a non-toxic oiL e.g. squalane or 
squalenc, an emulsifier. e.g. Tween 80. in an aqueous carrier. The aqueous carrier may 
be, for example, phosphate buffered saline. 

A particularly potent adjuvant formulation involving QS2L 3D-MPL and tocopherol 
in an oil in water emulsion is described in WO 95/17210. 

The present invention also provides a polyvalent vaccine composition comprising a 
vaccine formulation of the invention in combination with other antigens, in particular 
antigens useful for treating cancers, autoimmune diseases and related conditions. Such a 
polyvalent vaccine composition may include a TH-1 inducing adjuvant as hereinbefore 
described. 

While the invention has been described with reference to certain BASB027 polypeptides 
and polynucleotides, it is to be understood that this covers fragments of the naturally 
occurring polypeptides and polynucleotides, and similar polypeptides and polynucleotides 
with additions, deletions or substitutions which do not substantially affect the 
immunogenic properties of the recombinant polypeptides or polynucleotides. 

Compositions, kits and administration 

In a further aspect of the invention there are provided compositions comprising a BASB027 
polynucleotide and/or a BASB027 polypeptide for administration to a cell or to a 
multicellular organism. 

The invention also relates to compositions comprising a polynucleotide and/or a 
polypeptides discussed herein or their agonists or antagonists. The polypeptides and 
polynucleotides of the invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical 
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carrier suitable for administration to an individual. Such compositions comprise, for 
instance, a media additive or a therapeutically effective amount of a polypeptide and/or 
polynucleotide of the invention and a pharmaceutically acceptable carrier or excipient. Such 
carriers may include, but are not limited to. saline, buffered saline, dextrose, water, glycerol, 
ethanol and combinations thereof. The formulation should suit the mode of administration. 
The invention further relates to diagnostic and pharmaceutical packs and kits comprising 
one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides, polynucleotides and other compounds of the invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. 

The pharmaceutical compositions may be administered in any effective, convenient manner 
including, for instance, administration by topical, oral, anal vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as 
an injectable composition, for example as a sterile aqueous dispersion, preferably 
isotonic. 

In a further aspect, the present invention provides for pharmaceutical compositions 
comprising a therapeutically effective amount of a polypeptide and/or polynucleotide, such 
as the soluble form of a polypeptide and/or polynucleotide of the present invention, agonist 
or antagonist peptide or small molecule compound, in combination with a pharmaceutically 
acceptable carrier or excipient. Such carriers include, but are not limited to, saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The invention further 
relates to pharmaceutical packs and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned compositions of the invention. 
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Polypeptides, polynucleotides and other compounds of the present invention may be 
employed alone or in conjunction with other compounds, such as therapeutic compounds. 

The composition will be adapted to the route of administration, for instance by a systemic or 
an oral route. Preferred forms of systemic administration include injection, typically by 
intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdermal administration using penetrants such as bile salts or fusidic 
acids or other detergents. In addition, if a polypeptide or other compounds of the present 
invention can be formulated in an enteric or an encapsulated formulation, oral 
administration may also be possible. Administration of these compounds may also be 
topical and/or localized, in the form of salves, pastes, gels, solutions, powders and the like. 

For administration to mammals, and particularly humans, it is expected that the daily 
dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 
mg/kg. The physician in any event will determine the actual dosage which will be most 
suitable for an individual and will vary with the age, weight and response of the particular 
individual. The above dosages are exemplary of the average case. There can, of course, 
be individual instances where higher or lower dosage ranges are merited, and such are 
within the scope of this invention. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of the subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however, are in the range of 0.1-100 ng/kg of 
subject. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be 
employed to enhance the immune response. A suitable unit dose for vaccination is 0.5-5 
microgram/kg of antigen, and such dose is preferably administered 1-3 times and with an 
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interval of 1-3 weeks. With the indicated dose range, no adverse toxicological effects will 
be observed with the compounds of the invention which would preclude their 
administration to suitable individuals. 

Wide variations in the needed dosage, however, are to be expected in view of the variety of 
compounds available and the differing efficiencies of various routes of administration. For 
example, oral administration would be expected to require higher dosages than 
administration by intravenous injection. Variations in these dosage levels can be adjusted 
using standard empirical routines for optimization, as is well understood in the art. 

Sequence Databases, Sequences in a Tangible Medium, and Algorithms 

Polynucleotide and polypeptide sequences form a valuable information resource with which 
to determine their 2- and 3-dimensional structures as well as to identify further sequences of 
similar homology. These approaches are most easily facilitated by storing the sequence in a 
computer readable medium and then using the stored data in a known macromolecular 
structure program or to search a sequence database using well known searching tools, such 
as the GCG program package. 

Also provided by the invention are methods for the analysis of character sequences or 
strings, particularly genetic sequences or encoded protein sequences. Preferred methods 
of sequence analysis include, for example, methods of sequence homology analysis, such 
as identity and similarity analysis, DNA. RNA and protein structure analysis, sequence 
assembly, cladistic analysis, sequence motif analysis, open reading frame determination, 
nucleic acid base calling, codon usage analysis, nucleic acid base trimming, and 
sequencing chromatogram peak analysis. 

A computer based method is provided for performing homology identification. This 
method comprises the steps of: providing a first polynucleotide sequence comprising the 
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sequence of a polynucleotide of the invention in a computer readable medium: and 
comparing said first polynucleotide sequence to at least one second polynucleotide or 
polypeptide sequence to identify 7 homology. 

A computer based method is also provided for performing homology identification, said 
method comprising the steps of: providing a first polypeptide sequence comprising the 
sequence of a polypeptide of the invention in a computer readable medium: and 
comparing said first polypeptide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 
herein in its entirety in the manner described above for publications and references. 



DEFINITIONS 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
or two or more polynucleotide sequences, as the case may be, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
between strings of such sequences. "Identity" can be readily calculated by known 
methods, including but not limited to those described in {Computational Molecular 
Biology, Lesk. A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W. ? ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heine, 
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G.. Academic Press. 1987; and Sequence Analysis Primer. Gribskov. M. and Devereux, J., 
eds., M Stockton Press. New York. 1991; and Carillo. H., and Lipman, D.. SI AM J. 
Applied Math., 48: 1073 (1988). Methods to determine identity are designed to give the 
largest match between the sequences tested. Moreover, methods to determine identity arc 
codified in publicly available computer programs. Computer program methods to 
determine identity between two sequences include, but are not limited to. the GAP 
program in the GCG program package (Devereux, J., et aL Nucleic Acids Research 12(1): 
387 (1984)), BLASTP. BLASTN (Altschul. S.F. et aL Jl Molec. Bioi 215: 403-410 
(1990). and FASTAf Pearson and Lipman Proc. Natl. Acad. Sci. USA 85; 2444-2448 
(1988). The BLAST family of programs is publicly available from NCBI and other 
sources {BLAST Manual. Altschul. S., et aL, NCBI NLM NIH Bethesda, MD 20894; 
Altschul, S.. et ai.J. MoL Biol. 215: 403-410 (1990). The well known Smith Waterman 
algorithm may also be used to determine identity. 

Parameters for polypeptide sequence comparison include the following: 

Algorithm: Needleman and WunscL J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Henikoff and Henikoff, 

Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992) 

Gap Penalty: 8 

Gap Length Penalty: 2 

A program useful with these parameters is publicly available as the "gap" program from 
Genetics Computer Group, Madison WL The aforementioned parameters are the default 
parameters for peptide comparisons (along with no penalty for end gaps). 

Parameters for polynucleotide comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: matches = +10, mismatch = 0 

Gap Penalty: 50 

Gap Length Penalty: 3 
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Available as: The "gap" program from Genetics Computer Group, Madison WI. These 
are the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may 
be, are provided in (1 ) and (2) below. 

.( 1 ) Polynucleotide embodiments further include an isolated polynucleotide 
comprising a polynucleotide sequence having at least a 50, 60, 70, 80, 85, 90. 95. 97 or 
100% identity to the reference sequence of SEQ ID NO:l, wherein said polynucleotide 
sequence may be identical to the reference sequence of SEQ ID NO:l or may include up 
to a certain integer number of nucleotide alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting ofat least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 
individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence, and wherein said number of nucleotide 
alterations is determined by multiplying the total number of nucleotides in SEQ ID NO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleotides in SEQ ID NO: 1, or: 

n n < x n - (x n • y), 

wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides 
in SEQ ID NO:l, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for 
themultiplication operator, and wherein any non-integer product of x n and y is rounded 
down to the nearest integer prior to subtracting it from x n . Alterations of a polynucleotide 
sequence encoding the polypeptide of SEQ ID NO:2 may create nonsense, missense or 
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frameshift mutations in this coding sequence and thereby alter the polypeptide encoded by 
the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ ID NO:l. that is it may be 100% identical, or it may 
include up to a certain integer number of nucleic acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one nucleic acid deletion, 
substitution, including transition and transversion, or insertion, and wherein said 
alterations may occur at the 5' or 3' terminal positions of the reference polynucleotide 
sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleic acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of nucleic acid alterations for a given percent 
identity is determined by multiplying the total number of nucleic acids in SEQ ID NO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleic acids in SEQ ID NO: 1 , or: 

n n < x n - (x n • y), 

wherein n n is the number of nucleic acid alterations, x n is the total number of nucleic 
acids in SEQ ID NO:L y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., • 
is the symbol for the multiplication operator, and wherein any non-integer product of x n 
and y is rounded down to the nearest integer prior to subtracting it from x n . 

(2 ) Polypeptide embodiments further include an isolated polypeptide comprising a 
polypeptide having at least a 50.60. 70, 80, 85, 90, 95, 97 or 100% identity to a 
polypeptide reference sequence of SEQ ID NO:2, wherein said polypeptide sequence may- 
be identical to the reference sequence of SEQ ID NO:2 or may include up to a certain 
integer number of amino acid alterations as compared to the reference sequence, wherein 
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said alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence, and wherein said number of 
amino acid alterations is determined by multiplying the total number of amino acids in 
SEQ ID NO:2 by the integer defining the percent identity divided by 100 and then 
subtracting that product from said total number of amino acids in SEQ ID NO;2, or: 

n a < x a - (x a • y). 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of x a and y is rounded 
down to the nearest integer prior to subtracting it from x a . 

By way of example, a polypeptide sequence of the present invention may be identical to 
the reference sequence of SEQ ID NO:2, that is it may be 1 00% identical or it may 
include up to a certain integer number of amino acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence. The number of amino acid 
alterations for a given % identity is determined by multiplying the total number of amino 
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acids in SEQ ID NO:2 by the integer defining the percent identity divided by 100 and 
then subtracting that product from said total number of amino acids in SEQ ID NO:2. or: 

n a < x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2. y is. for instance 0.70 for 70%, 0.80 for 80%. 0.85 for 85% etc.. and • is 
the symbol for the multiplication operator, and wherein any non-integer product of x a and 
y is rounded down to the nearest integer prior to subtracting it from x a . 

"IndividuaKsV when used herein with reference to an organism, means a multicellular 
eukaryote. including, but not limited to a metazoan, a mammal, an ovid. a bovid. a simian, 
a primate, and a human. 

"Isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs in 
nature, it has been changed or removed from its original environment, or both. For example, 
a polynucleotide or a polypeptide naturally present in a living organism is not "isolated," but 
the same polynucleotide or polypeptide separated from the coexisting materials of its natural 
state is "isolated", as the term is employed herein. Moreover, a polynucleotide or 
polypeptide that is introduced into an organism by transformation, genetic manipulation or 
by any other recombinant method is "isolated" even if it is still present in said organism, 
which organism may be living or non-living. 

"Polynucleotide(s)" generally refers to any polyribonucleotide or polydeoxyribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA including single and 
double-stranded regions. 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
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polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result in amino acid substitutions, additions, deletions, fusions and truncations in 
the polypeptide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Generally, differences are limited so that the sequences of the reference 
polypeptide and the variant are closely similar overall and. in many regions, identical. 
A variant and reference polypeptide may differ in amino acid sequence by one or more 
substitutions, additions, deletions in any combination. A substituted or inserted amino 
acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or 
it may be a variant that is not known to occur naturallv. Non-naturallv occurrinu 
variants of polynucleotides and polypeptides may be made by mutagenesis techniques 
or by direct synthesis. 

M Disease(s) n means any disease caused by or related to infection by a bacteria, including, 
for example, otitis media in infants and children, pneumonia in elderlies, sinusitis, 
nosocomial infections and invasive diseases, chronic otitis media with hearing loss, fluid 
accumulation in the middle ear. auditive nerve damage, delayed speech learning, infection 
of the upper respiratory tract and inflammation of the middle ear. 
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EXAMPLES: 

The examples below are carried out using standard techniques, which are well known and 
routine to those of skill in the an. except where otherwise described in detail. The examples 
are illustrative, but do not limit the invention. 

Example 1 : 

Discovery and confirmatory DNA sequencing of the BASB027 gene from 
Moraxella catarrhalis strain ATCC 43617. 

The BASB027 gene of SEQ ID NO:l was first discovered in the Incyte PathoSeq data 
base containing unfinished genomic DNA sequences of the Moraxella catarrhalis strain 
ATCC 43617 (also referred to as strain Mc2931). The translation of the BASB027 
polynucleotide sequence, shown in SEQ ID NO:2, showed significant similarity (32 % 
identity in a 8 1 7 amino acids overlap) to the OMP85 outer membrane protein of 
Neisseria meningitidis. 

The sequence of the BASB027 gene was further confirmed experimentally. For this 
purpose, genomic DNA was extracted from 10 10 cells of the M, catarrhalis cells (strain 
ATCC 43617) using the QIAGEN genomic DNA extraction kit (Qiagen Gmbh), and 
Ijig of this material was submitted to Polymerase Chain Reaction DNA amplification 
using primers E5 155 15 (5'- ACT-ATA-GGG-CAC-GCG-TG -3') [SEQ ID NO:5] and 
E515528 : (5'- CCT-GCG-TTT-GTT-TGA-TTG-AG-3 ') [SEQ IDNO:6]. This PCR 
product was purified on a Biorobot 9600 (Qiagen Gmbh) apparatus and subjected to 
DNA sequencing using the Big Dye Cycle Sequencing kit (Perkin-Elmer) and an ABI 
377/PRJSM DNA sequencer. DNA sequencing was performed on both strands with a 
redundancy of 2 and the full length sequence was assembled using the SeqMan program 
from the DNASTAR Lasergene software package. The resulting DNA sequence and 
deduced polypeptide sequence are shown as SEQ ID NO:3 and SEQ ID NO:4 
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respectively. Four nucleotide differences distinguish SEQ ID NO:3 from SEQ ID NO:l. 
Using the MEGALIGN program from the DNASTAR Lasergene software package, an 
alignment of the polynucleotide sequences of SEQ ID NO:l and 3 was performed, and 
is displayed in Figure 2; their level of identity was calculated to be 99.8 %. 
Using the same program, an alignment of the polypeptide sequences of SEQ ID NO:2 
and 4 was performed, and is displayed in Figure 3; their level of identity- was calculated 
to be 99.8 %. 

Example 2: 

Variability analysis of the BASB027 gene among several Moraxella catarrhalis 
strains. 

2A: Restriction Fragment Length Analysis (RFLP). 

Genomic DNA was extracted from 16 M catarrhalis strains (presented in Table 1) as 
described below. M. catarrhalis was streaked for single colonies on BHI agar plates 
and grown overnight at 37 °C. Three or four single colonies were picked and used to 
inoculate a -1.5 ml BHI (Brain-heart infusion) broth seed culture which was grown 
overnight, in a shaking incubator -300 rpm, at 37 °C. A 500ml erlenmeyer flask 
containing -150 ml of BHI broth was inoculated with the seed culture and grown for 
-12-16 hours at 37 °C in a shaking incubator, -175 rpm, to generate cell mass for DNA 
isolation. Cells were collected by centrifugation in a Sorvall GSA rotor at -2000 X g 
for 15 minutes at room temperature. The supernatant was removed and the cell pellet 
suspended in -5.0 ml of sterile water. An equal volume of lysis buffer (200 mM NaCl, 
20 mM EDTA, 40 mM Tris-Hcl, pH 8.0, 0.5% (w/v) SDS, 0.5% (v/v) 2- 
mercaptoethanol, and 250 (ig/ml of proteinase K) was added and the cells suspended by 
gentle agitation and trituration. The cell suspension was then incubated -12 hours at 
50°C to lyse the bacteria and liberate chromosomal DNA. Proteinaceous material was 
precipitated by the addition of 5.0 ml of saturated NaCl (-6.0 M, in sterile water) and 
centrifugation at -5,500xg in a Sorvall SS34 rotor at room temperature. Chromosomal 
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DNA was precipitated from the cleared supernatant by the addition of two volumes of 
100 % ethanol. Aggregated DNA was collected and washed using gentle agitation in a 
small volume of a 70 % ethanol solution. Purified chromosomal DNA was suspended 
in sterile water and allowed to dissolve/ disburse overnight at 4 °C by gentle rocking. 
The concentration of dissolved DNA was determined spectrophotometrically at 260 nm 
using an extinction coefficient of 1 .0 O.D. unit -50 jug/ml. 

This materia! was next submitted to PCR amplification using the MC-D15-BamF (5'- 
A AG GGC CCA ATT ACG CAG AGG GGA TCC AC A GGA CTA CAG CGA GTG 
ACC ATT GAA AGC TTA C -3 ? ) [SEQ ID NO:7] and MC-D15-SalRC (AAG GGC 
CCA ATT ACG CAG AGG GTC GAC TTA TTA AAA GAC ACT ACC AAT CTG 
GAA CTG TAC CGT ATC G -3') [SEQ ID NO:8] oligonucleotides. The corresponding 
BASB027 gene amplicons were then subjected independantly to hydrolysis using 
restriction enzymes (AciL Hindlll, Maelll, NIallh RsaL SaiuAl) and restriction 
products were separated by agarose or polyacrylamide gel electrophoresis using 
standard molecular biology procedures as described in '* Molecular Cloning, a 
Laboratory Manual, Second Edition, Eds: Sambrook, Fritsch & Maniatis, Cold Spring 
Harbor press 1989". The photographs of the resulting electrophoresis gels are displayed 
in Figure 1 . For each strain, RFLP patterns corresponding to the 6 restriction enzymes 
were scored and combined. Groups of strains sharing identical combination of RFLP 
patterns were then defined. Using this methodology, the strains tested in this study fell 
into 4 genomic groups ( Group 1 : Mc2906, Mc 2908, Mc2912, Mc2926; Group 2 : 
Mc2905, Mc2907, Mc2909, Mc2911, Mc2913, Mc2960, Mc2975 ; Group 3 : Mc2910, 
Mc2912, Mc2956, Mc2969; Group 4 : Mc293 1). These data support that the Moraxella 
catarrhalis population used in this study displays limited nucleotide sequence diversity 
for the BASB027 gene. 
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Table 1: Features of the Moraxella catarrhalis strains used in this study 



Strain 


Isolated in: 


from: 


Mc2904 


USA 


Tympanocentesis 


Mc2905 


USA 


Tympanocentesis 


Mc2906 


USA 


Tympanocentesis 


Mc2907 


USA 


Tympanocentesis 


Mc2908 


USA 


Acute otitis Tympanocentesis 


Mc2909 


USA 


Tympanocentesis 


Mc2910 


USA 


Tympanocentesis 


Mc2911 


USA 


Acute otitis Tympanocentesis 


Mc2912 


USA 


Acute otitis Tympanocentesis 


Mc2913 


USA 


Acute otitis Tympanocentesis 


Mc2926 


USA 


Tympanocentesis 


Mc2931 


USA 


Transtracheal aspirate 


/ATCC 






43617 






Mc2956 


Finland 


Middle ear fluid 


Mc2960 


Finland 


Middle ear fluid 


Mc2969 


Norway 


Nasopharynx (Pharyngitis- 
Rhinitis) 


Mc2975 


Norway 


Nasopharynx (Rhinitis) 
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Example 3: Construction of Plasmid to Express Recombinant BASB027 

A: Cloning of BASB027 . 

The BamHJ and Sail restriction sites engineered into the forward ([SEQ ID NO:7]} and 
reverse complementary ([SEQ ID NO:8]) amplification primers, respectively, permitted 
directional cloning of an -2500 bp PCR product into the commercially available E.coli 
expression plasmid pQE30 (QiaGen. ampicillin resistant) such that a mature BASB027 
protein could be expressed as a fusion protein containing a (His)6 affinity 
chromatography tag at the N-terminus. The BASB027 PCR product was purified from 
the amplification reaction using silica gel-based spin columns (QiaGen) according to the 
manufacturers instructions. To produce the required BamYll and Sail termini necessary 
for cloning, purified PCR product was sequentially digested to completion with BamWl 
and Sail restriction enzymes as recommended by the manufacturer (Life Technologies). 
Following the first restriction digestion, the PCR product was purified via spin column 
as above to remove salts and eluted in sterile water prior to the second enzyme 
digestion. The digested DNA fragment was again purified using silica gel-based spin 
columns prior to ligation with the pQE30 plasmid. 

B: Production of Expression Vector. 

To prepare the expression plasmid pQE30 for ligation, it was similarly digested to 
completion with both BamHl and Sail and then treated with calf intestinal phosphatase 
(CIP, -0.02 units / pmole of 5' end. Life Technologies) as directed by the manufacturer 
to prevent self ligation. An approximately 5-fold molar excess of the digested fragment 
to the prepared vector was used to program the ligation reaction. A standard -20 |il 
ligation reaction (-16°C. -16 hours), using methods well known in the art, was 
performed using T4 DNA ligase (-2.0 units / reaction, Life Technologies). An aliquot 
of the ligation (-5 was used to transform electro-competent M15(pREP4) cells 
according to methods well known in the art. Following a -2-3 hour outgrowth period at 
37°C in -1 .0 ml of LB broth, transformed cells were plated on LB agar plates 
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containing kanamycin (50 fig/ml) and ampiciliin ( 100 jig/ml). Both antibiotics were 
included in the selection media to ensure that all transformed cells carried both the 
pR£P4 plasmid (KnR), which carries the laclq gene necessary for the repression of 
expression for IPTG-inducible expression of proteins on pQE30. and the pQE30- 
BASB027 plasmid (ApR). Plates were incubated overnight at 37°C for -16 hours. 
Individual KnR / ApR colonies were picked with sterile toothpicks and used to " patch*' 
inoculate fresh LB KnR / ApR plates as well as a -1 .0 ml LB KnR / ApR broth culture. 
Both the patch plates and the broth culture were incubated overnight at 37°C in either a 
standard incubator (plates) or a shaking water bath. 

A whole cell-based PCR analysis was employed to verify that transformants contained 
the BASB027 DNA insert. Here, the -1 .0 ml overnight LB Kn / Ap broth culture was 
transferred to a 1 .5 ml polypropylene tube and the cells collected by centrifugation in a 
Beckmann microcentrifuge (-3 min., room temperature, -12,000 X g). The cell pellet 
was suspended in —200|nl of sterile water and a -10|il aliquot used to program a -50|il 
final volume PCR reaction containing both BASB027 forward and reverse amplification 
primers. Final concentrations of the PCR reaction components were essentially the 
same as those specified in example 2 except -5.0 units of Tag polymerase was used. 
The initial 95 °C denaturation step was increased to 3 minutes to ensure thermal 
disruption of the bacterial cells and liberation of plasmid DNA. An ABI Model 9700 
thermal cycler and a 32 cycle, three-step thermal amplification profile, i.e. 95°C 45sec; 
55-58°C 45sec, 72°C, Imin., were used to amplify the BASB027 PCR fragment from 
the lysed transformant samples. Following thermal amplification, a -20|il aliquot of the 
reaction was analyzed by agarose gel electrophoresis (0.8 % agarose in a Tris-acetate- 
EDTA (TAE) buffer). DNA fragments were visualized by UV illumination after gel 
electrophoresis and ethidium bromide staining. A DNA molecular size standard (1 Kb 
ladder. Life Technologies) was electrophoresed in parallel with the test samples and was 
used to estimate the size of the PCR products. Transformants that produced the 
expected - 2500 bp PCR product were identified as strains containing a BASB027 
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expression construct. Expression plasmid containing strains were then analyzed for the 
inducible expression of recombinant BASB027. 

C: Expression Analysis of PCR-Positive Transformants. 

For each PCR-positive trans formant identified above, -5.0 ml of LB broth containing 
kanamycin (50 jig/ml) and ampicillin (100 |ag/ml) was inoculated with cells from the 
patch plate and grown overnight at 37 °C with shaking (-250 rpm). An aliquot of the 
overnight seed culture (-1.0 ml) was inoculated into a 125 ml erlenmeyer flask 
containing -25 ml of LB Kn / Ap broth and grown at 37 °C with shaking (-250 rpm) 
until the culture turbidity reached O.D.600 of -0.5. i.e. mid-log phase (usually about 1.5 
- 2.0 hours). At this time approximately half of the culture (-12.5 ml) was transferred to 
a second 125 ml flask and expression of recombinant BASB027 protein induced by the 
addition of IPTG (1.0 M stock prepared in sterile water, Sigma) to a final concentration 
of 1 .0 mM. Incubation of both the IPTG-induced and non-induced cultures continued 
for an additional -4 hours at 37 °C with shaking. Samples (-1 .0 ml) of both induced 
and non-induced cultures were removed after the induction period and the cells 
collected by centrifugation in a microcentrifuge at room temperature for -3 minutes. 
Individual cell pellets were suspended in — 50(il of sterile water, then mixed with an 
equal volume of 2X Laemelli SDS-PAGE sample buffer containing 2-mercaptoethanol, 
and placed in boiling water bath for -3 min to denature protein. Equal volumes (-1 5\il) 
of both the crude IPTG-induced and the non-induced cell lysates were loaded onto 
duplicate 12% Tris/glycine polyacrylamide gel (1 mm thick Mini-gels, Novex). The 
induced and non-induced lysate samples were electrophoresed together with prestained 
molecular weight markers (SeeBlue, Novex) under conventional conditions using a 
standard SDS/Tris/glycine running buffer (BioRad). Following electrophoresis, one gel 
was stained with commassie brilliant blue R250 (BioRad) and then destained to 
visualize novel BASB027 IPTG-inducible protetn(s). The second gel was electroblotted 
onto a PVDF membrane (0.45 micron pore size, Novex) for -2 hrs at 4 °C using a 
BioRad Mini-Protean II blotting apparatus and Towbin's methanol (20 %) transfer 
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buffer. Blocking of the membrane and antibody incubations were performed according 
to methods well known in the art. A monoclonal anti-RGS (His)3 antibody, followed 
by a second rabbit anti-mouse antibody conjugated to HRP (QiaGen), was used to 
confirm the expression and identity of the BASB027 recombinant protein. 
Visualization of the anti-His antibody reactive pattern was achieved using either an 
ABT insoluble substrate or using Hyperfilm with the Amersham ECL 
chemiluminescence system. 

D: Sequence Confirmation . 

To further verify that the IPTG-inducible recombinant BASB027 protein being 
expressed is in the correct open reading frame and not a spurious molecule arising from 
a cloning artifact (i.e. a frame-shift), the DNA sequence of the cloned insert was 
determined. The DNA sequence for the M.catarrhalis BASB027 gene was obtained 
from one strand using conventional asymmetric PCR cycle sequencing methodologies 
(ABI Prism Dye-Terminator Cycle Sequencing, Perkin-Elmer). Sequencing reactions 
were programmed with undigested expression plasmid DNA (~-0.5^ig/rxn) as a template 
and appropriate pQE30 vector-specific and ORF-specific sequencing primers (-3.5 
pmol/rxn). In addition to the template and sequencing primer, each sequencing reaction 
(~-20uI) contained the four different dNTPs (i.e. A ? G,C and T) and the four 
corresponding ddNTPs (i.e. ddA, ddG, ddC, and ddT) terminator nucleotides; with each 
terminator being conjugated to one of the four fluorescent dyes, Joe, Tarn, Rox, or Fam. 
Single strand sequencing elongation products were terminated at random positions 
along the template by the incorporation of the dye-labelled ddNTP terminators. 
Fluorescent dye-labelled termination products were purified using microcentrifuge size- 
exclusion chromatography columns (Princeton Genetics), dried under vacuum, 
suspended in a Template Resuspension Buffer (Perkin-Elmer) for capillary 
electrophoresis or deionized formamide for PAGE, denatured at 95°C for -5 min, and 
analyzed by high resolution capillary electrophoresis (ABI 310 Automated DNA 
Sequenator, Perkin-Elmer) or high resolution PAGE (ABI 377 Automated DNA 



56 



WO 99/63093 




PCT/EP99/03822 



Sequenator) as recommended by the manufacturer. DNA sequence data produced from 
individual reactions were collected and the relative fluorescent peak intensities analyzed 
automatically on a PowerMAC computer using ABI Sequence Analysis Software 
(Perkin-Elmer). Individually autoanalyzed DNA sequences were edited manually for 
accuracy before being merged into a consensus single strand sequence "string" using 
AutoAssembler software (Perkin-Elmer). Sequencing determined that the expression 
plasrnid contained the correct sequence in the correct open reading frame. 

Example 4 : Production of Recombinant BASB027 

Bacterial strain 

A recombinant expression strain of £. coli Ml 5 (pREP4) containing a plasrnid (pQE30) 
encoding BASB027 from M catarrhalis, was used to produce cell mass for purification 
of recombinant protein. The expression strain was cultivated on LB agar plates 
containing 50ng/ml kanamycin ("Kn") and lOOfig/ml ampicillin ("Ap") to ensure both 
the pREP4 laclq control plasrnid and the pQE30-BASB027 expression construct were 
both maintained. For cryopreservation at -80 °C, the strain was propagated in LB broth 
containing the same concentration of antibiotics then mixed with an equal volume of LB 
broth containing 30% (w/v) glycerol. 

Media 

The fermentation medium used for the production of recombinant protein consisted of 
2X YT broth (Difco) containing 50|ig/ml Kn and 100|ig/ml Ap. Antifoam was added to 
medium for the fermentor at 0.25 ml/L (Antifoam 204, Sigma). To induce expression of 
the BASB027 recombinant protein, IPTG (Isopropyl B-D-Thiogalactopyranoside) was 
added to the fermentor (1 mM. final). 

Fermentation 
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A 500-ml erlenmeycr seed flask, containing 50ml working volume, was inoculated with 
0.3 ml of rapidly thawed frozen culture, or several colonies from a selective agar plate 
culture, and incubated for approximately 12 hours at 37 ± 1 °C on a shaking platform at 
1 50rpm (Innova 2 1 00, New Brunswick Scientific). This seed culture was then used to 
inoculate a 5-L working volume fermentor containing 2X YT broth and both Kn and Ap 
antibiotics. The fermentor (Bioflo 3000, New Brunswick Scientific) was operated at 37 
± 1°C 0.2 - 0.4 VVM air sparge, 250 rpm in Rushton impellers. The pH was not 
controlled in either the flask seed culture or the fermentor. During fermentation, the pH 
ranged 6.5 to 7.3 in the fermentor. IPTG (1.0 M stock, prepared in sterile water) was 
added to the fermentor when the culture reached mid-log of growth (-0.7 O.D.600 
units). Cells were induced for 2 - 4 hours then harvested by centrifugation using either a 
28RS Heraeus (Sepatech) or RC5C superspeed centrifuge (Sorvall Instruments). Cell 
paste was stored at -20 °C until processed. 

Purification 

Chemicals and Materials 

Imidazole, guanidine hydrochloride, Tris (hydroxymethyl), and EDTA (ethylene- 
diamine tetraacetic acid) biotechnology grade or better were all obtained from 
Ameresco Chemical, Solon, Ohio. Triton X-100 (t-Octylphenoxypolyethoxy-ethanol), 
sodium phosphate, monobasic, and Urea were reagent grade or better and obtained from 
Sigma Chemical Company, St. Louis, Missouri. Glacial acetic acid and hydrochloric 
acid were obtained from Mallincrodt Baker Inc., Phillipsburg, New Jersey. Methanol 
was obtained from Fisher Scientific, Fairlawn, New Jersey. Pefabloc®SC (4-(2- 
Aminoethyl)-benzenesulfonylfuoride), Complete protease inhibitor cocktail tablets, and 
PMSF (phenylmethyl-sulfonylfluoride) were obtained from Roche Diagnostics 
Corporation, Indianapolis. Indiana. Bestatin, Pepstatin A, and E-64 protease inhibitor 
were obtained from Calbiochem, LaJolla, California. Dulbecco's Phosphate Buffered 
Saline(lx PBS) was obtained from Quality Biological, Inc., Gaithersburg, Maryland. 
Dulbecco's Phosphate Buffered Saline (lOx PBS) was obtained from BioWhittaker, 
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Walkersvillc. Maryland. Penta-His Antibody. BSA free was obtained from QiaGen. 
Valencia. California. Peroxidase-conjugated AffiniPure Goat Anti-mouse igG was 
obtained from Jackson Immuno Research. West Grove. Penn. AEC single solution was 
obtained from Zymed, South San Francisco. California. All other chemicals were 
reagent grade or better. 

Ni-NTA Supcrflow resin was obtained from QiaGen Inc., Valencia. California. Precast 
Tris-Glycine 4-20% and 10-20% polyacrylamide gels, all running buffers and solutions. 
SeeBiue Pre-Stained Standards. MultiMark Multi-Colored Standards and PVDF transfer 
membranes were obtained from Novex, San Diego, California. SDS-PAGE Silver Stain 
kits were obtained from Daiichi Pure Chemicals Company Limited, Tokyo, Japan. 
Coomassie Stain Solution was obtained from Bio-Rad Laboratories. Hercules, 
California. Acrodisc® PF 0.2 m syringe filters were obtained from Pall Gelman 
Sciences. Ann Arbor, Michigan. GD/X 25mm disposable syringe filters were obtained 
from Whatman Inc., Clifton, New Jersey. Dialysis tubing 8,000 MWCO was obtained 
from BioDcsign Inc. Od New York, Carmal New York. BCA Protein Assay Reagents 
and Snake Skin dialysis tubing 3,500 MWCO were obtained from Pierce Chemical Co., 
Rockford, Illinois. 

Extraction Protocol 

Cell paste was thawed at room temperature for 30 to 60 minutes. Five to six grams of 
material was weighed out into a 50 ml disposable centrifuge tube. To this five 
mis/gram of Guanidine hydrochloride (Gu-HCl) buffer was added (6 M Guanidine 
hydrochloride. 100 mM Sodium phosphate, monobasic, 10 mM Tris and 0.05 % Triton 
X-100, pH 8.0). Cell paste was resuspended using a PRO300D proscientific 
homogenizes at 3/4 power for one minute. The extraction mixture was then placed at 
room temperature with gentle agitation for 60 to 90 minutes. After 60 to 90 minutes the 
extraction mixture was centrifuged at 15,800 x g for 15 minutes (Sorvall RC5C 
centrifuge. 1 1 .500 rpm). The supernatant (S 1 ) was decanted and saved for additional 
purification. The pellet (PI) was saved for analysis. 
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Binding of BASB027 to Nickel-NTA Resin 

To the SI three to four mis of Ni-NTA resin is added. This is then placed at room 
temperature with gentle agitation for one hour. After one hour the SI /Ni-NTA is 
packed into an XK16 Pharmacia column. The column is then washed with 1 M Gu-HCl 
buffer (1 M Guanidine hydrochloride, lOOmM Sodium phosphate, monobasic, 10 mM 
Tris and 0.05% Triton X-100. pH 8.0). This is then followed by a wash with phosphate 
buffer (lOOmM Sodium phosphate, monobasic, 10 mM Tris and 0.05% Triton X-100, 
pH 6.3). The protein is then eluted from the column with a 250 mM imidazole buffer 
(250 mM imidazole. lOOmM Sodium phosphate, monobasic. 10 mM Tris and 0.05% 
Triton X-100, pH 5.9). 

Final Formulation 

BASB027 was formulated by dialysis overnight against, three changes of 0.1 % Triton 
X-100 and lx PBS, pH 7.4. to remove residual Gu-HCl and imidazole. The purified 
protein was characterized and used to produce antibodies as described below. 

Biochemical Characterizations 
SDS-PAGE and Western Blot Analysis 

The recombinant purified protein was resolved on 4-20 % polyacrylamide gels and 
electrophoretically transferred to PVDF membranes at 100 V for 1 hour as previously 
described (Thebaine et al. 1979, Proc. Natl. Acad. Sci. USA 76:4350-4354). The 
PVDF membranes were then pretreated with 25 ml of Dulbecco's phosphate buffered 
saline containing 5 % non-fat dry milk. All subsequent incubations were carried out 
using this pretreatment buffer. 

PVDF membranes were incubated with 25 ml of a 1:500 dilution of preimmune serum 
or rabbit anti-His immune serum for 1 hour at room temperature. PVDF membranes 
were then washed twice with wash buffer (20 mM Tris buffer, pH 7.5, containing 150 
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mM sodium chloride and 0.05 % Tween-20). PVDF membranes were incubated with 
25 ml of a 1 :5000 dilution of peroxidase-labeled goat anti-rabbit IgG (Jackson 
ImmunoResearch Laboratories. West Grove. PA") for 30 minutes at room temperature. 
PVDF membranes were then washed 4 times with wash buffer, and were developed 
with 3-amino-9-ethyicarbazole and urea peroxide as supplied by Zymed (San Francisco. 
CA) for 10 minutes each. 

The results of an SDS-PAGE (Figure 4) show a protein about 95 kDa that is reactive to 
an anti-RGS(His) antibody by western blots (Figure 5) of the SDS-PAGE. 

Protein Sequencing 

Amino terminal amino acid sequencing of the purified protein was performed to 
confirm the production of the correct recombinant protein using well defined chemical 
protocols on Hewlett-Packard model G1000A sequencer with a model 1090 LC and a 
Hewlett-Packard model 241 sequencer with a model 1 100 LC. 

Example 5 : Production of Antisera to Recombinant BASB027 

Polyvalent antisera directed against the BASB027 protein were generated by 
vaccinating two rabbits with the purified recombinant BASB027 protein. Each animal 
is given a total of three immunizations intramuscullarly (i.m.) of about 20|jg BASB027 
protein per injection (beginning with complete Freund's adjuvant and followed with 
incomplete Freund's adjuvant) at approximately 21 day intervals. Animals were bled 
prior to the first immunization ( n pre-bleed M ) and on days 35 and 57. 

Anti-BASB027 protein titres were measured by an ELISA using purified recombinant 
BASB027 protein (0.5 jig/well). The titre is defined as the highest dilution equal to or 
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greater than 0.1 as calculated with the following equation: average OD of two test 
samples of antisera - the average OD of two test samples of buffer. 
The antisera were used as the first antibody to identify the protein in a western blot as 
described in example 4 above. The western-blot shows the presence of anti-BASB027 
antibody in the sera of immunized animals (Figure 6). 

Example 6 : Immunological Characterisation 

Western Blot Analysis 

Several strains of M. catarrhalis were grown on chocolate agar plates for 48 hours at 
35°C in 5% C02. Several colonies were used to inoculate 25ml of Muller Hinton broth 
in a 250 ml flask. Cultures were grown overnight and collected by centrifugation. Cells 
were then solubilized by suspending 30|ag of cells in 150|il of PAGE sample buffer 
(360 mM Tris buffer, pH 8.8, containing 4% sodium dodecylsulfate and 20% glycerol), 
and incubating the suspension at 100°C for 5 minutes. The solubilized cells were 
resolved on 4-20% polyacrylamide gels and the separated proteins were 
electrophoretically transferred to PVDF membranes at 100V for lhour as previously 
described (Thebaine et al. 1979, Proc. Natl. Acad. Sci. USA 76:4350-4354). The 
PVDF membranes were then pretreated with 25 ml of Dulbecco's phosphate buffered 
saline containing 5 % non-fat dry milk. All subsequent incubations were carried out 
using this pretreatment buffer. 

PVDF membranes were incubated with 25ml of a 1 :500 dilution of preimmune serum or 
rabbit immune serum for lhour at room temperature. PVDF membranes were then 
washed twice with wash buffer (20 mM Tris buffer, pH 7.5, containing 150 mM sodium 
chloride and 0.05% Tween-20). PVDF membranes were incubated with 25ml of a 
1 :5000 dilution of peroxidase-labeled goat anti-rabbit IgG (Jackson ImmunoResearch 
Laboratories, West Grove, PA) for 30 minutes at room temperature. PVDF membranes 
were then washed 4 times with wash buffer, and were developed with 3-amino-9- 
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ethylcarbazole and urea peroxide as supplied by Zymed (San Francisco. CA) for 10 
minutes each. 

A protein of about 95 kDa (corresponding to BASB027 expected molecular weight) that 
is reactive with the antisera is detected in all Moraxella strains (Figure 7). 

Bactericidal Activity 

Complement-mediated cytotoxic activity of anti-BASB027 antibodies was examined to 
determine the vaccine potential of BASB027 polypeptide. Antiserum was prepared as 
described above. The activities of the pre-immune serum and the anti- BASB027 
antiserum in mediating complement killing of M. catarrhalis were examined using the 
"Serum Bactericidal Test" described by Zollinger et al. (Immune Responses to Neisseria 
meningitis, in Manual of Clinical Laboratory Immunoiogy. 3rd ed., pg 347-349), except 
that cells of M. catarrhalis strains or cultivars were used instead of Neisseria meningitis 
cells. 

The bactericidal titer of rabbit antiserum (50% killing of homologous strain) was <1:8 
(pre-immune) and > 1 : 1 28 (immune). 

Example 7 : Presence of Antibody to BASB027 in Human Convalescent Sera 

Western blot analysis of purified recombinant BASB027 were performed as described 
in Example 4 and 6 above, except that a pool of human sera from children infected by 
M. catarrhalis was used as the first antibody preparation. Results show that antisera 
from naturally infected individuals react to the purified recombinant. 

Example 8 : Production of BASB027 peptides, Antisera and Reactivity Thereof 
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Two short amino acid BASB027 specific peptides, having the sequences of 
CYAKPLNKKQNDQTDT (SEQ ID NO:9) and YLTARRGQQTTLGEVVC (SEQ ID 
NO: 10) were produced in the laboratory using generally well known methods. These 
peptides coupled to KLH were used to produce antibodies in 12 weeks old Specific 
Pathogen Free New-Zealand female rabbits. Rabbits received 4 injections at 
approximately 3 weeks intervals of 200 jig of peptide-KLH in complete (I s ' injection) or 
incomplete (2 nd , 3 rd and 4 th injections) Freund's adjuvant. Animals were bled prior to the 
first immunization and one month after the 4 th injection. 

Anti-peptide mid-point titres were measured by an ELISA using free peptides. Anti- 
peptide Mid-point titres one month after the 4" 1 immunization were superior to 1 5000. 
Western blots of purified recombinant BASB027, using anti-peptide antibodies as the 
first antibody, were prepared as described in Example 4 and 6. The results are presented 
in Figure 8. 
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Deposited materials 

A deposit containing a Moraxella catarrhalis Catlin strain has been deposited with the American 
Type Culture Collection (herein "ATCC") on June 21, 1997 and assigned deposit number 43617. 
The deposit was described as Branhamella catarrhalis (Frosch and Kolle) and is a freeze-dried. 1 .5- 
2.9 kb insert library constructed from M. catarrhalis isolate obtained from a transtracheal aspirate of 
a coal miner with chronic bronchitits. The deposit is described in Antimicrob. Agents Chemother. 
21: 506-50S (1982). 

The Moraxella catarrhalis strain deposit is referred to herein as "the deposited strain" or as "the 
DNA of the deposited strain." 

The deposited strain contains a full length BASB027 gene. 

A deposit of the vector pMC-DI5 consisting of Moraxella catarrhalis DNA inserted in pQE30 has 
been deposited with the American Type Culture Collection (ATCC) on February 12 1999 and 
assigned deposit number 207105. 

The sequence of the polynucleotides contained in the deposited strain / clone, as well as the amino 
acid sequence of any polypeptide encoded thereby, are controlling in the event of any conflict with 
any description of sequences herein. 

The deposit of the deposited strains have been made under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Micro-organisms for Purposes of Patent Procedure. The 
deposited strains will be irrevocably and without restriction or condition released to the public upon 
the issuance of a patent. The deposited strains are provided merely as convenience to those of skill 
in the art and are not an admission that a deposit is required for enablement, such as that required 
under 35 U.S.C. §112. 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule \2bts) 

A. The indications made below relate to the deposited microorganism or other bioiogicai material referred to in the description 
on page 65 , line 1-28 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 
Name of depositary institution 

. AMERICAN TYPE CULTURE COLLECTION 

Address of depositary institution ('including postal code and country; 

10801 UNIVERSITY BLVD, MANASSAS, VIRGINIA 20110-2209, 
UNITED STATES OF AMERICA 



Date of deposit 



21 June 1997 (21.06.97) & 



12 February 1999 (12.02.99) 



Accession Number 

43617 & 207105 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | | 

In respect of those designations where a European Patent is sought, a sample 
of the deposited microorganism will be made available until the publication 
of the mention of the grant of the European Patent or until the date on which 
the application has been refused or withdrawn, only by issue of such a sample 
to an expert nominated by the person requesting the sample. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed bciow will be submitted to the International Bureau later (specify the general nanire of the indications e.g.. "Accession 
Number of Deposit") 



For receiving Office use only 



This shcel was received with the international application 



Authorized officer 



R.LR. Pether 



For International Bureau use onlv 



| 1 This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (J U M99R^ 
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CLAIMS: 

1 . An isolated polypeptide comprising an amino acid sequence which has at least 85% 
identity to the amino acid sequence selected from the group consisting of: SEQ ID NO:2. 
SEQ ID NO:4. 

2. An isolated polypeptide as claimed in claim 1 in which the amino acid sequence has at 
least 95% identity to the amino acid sequence selected from the group consisting of: SEQ 
lDNO:2. SEQIDNO:4. 

3. The polypeptide as claimed in claim 1 comprising the amino acid sequence selected 
from the group consisting of: SEQ ID NO:2, SEQ ID NO:4. 

4. An isolated polypeptide of SEQ ID NO:2 or SEQ ID NO:4. 

5. .An immunogenic fragment of the polypeptide as claimed in any one of claims 1 to 4 in 
which the immunogenic activity of said immunogenic fragment is substantially the same 
as the polypeptide of SEQ ID NO:2, SEQ ID NO:4. 

6. An isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide 
that has at least 85% identity to the amino acid sequence of SEQ ID NO:2 ? 4 over the entire 
length of SEQ ID NO:2, 4 respectively; or a nucleotide sequence complementary to said 
isolated polynucleotide. 

7. An isolated polynucleotide comprising a nucleotide sequence that has at least 85% 
identity to a nucleotide sequence encoding a polypeptide of SEQ ID NO:2, 4 over the entire 
coding region; or a nucleotide sequence complementary to said isolated polynucleotide. 
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8. An isolated polynucleotide which comprises a nucieotide sequence which has at least 
S5% identity to that of SEQ ID NO:L 3 over the entire length of SEQ ID NO:l. 3 
respectively; or a nucleotide sequence complementary to said isolated polynucleotide, 

9. The isolated polynucleotide as claimed in any one of claims 6 to 8 in which the 
identity is at least 95% to SEQ ID NO:l, 3. 

1 0. An isolated polynucleotide comprising a nucleotide sequence encoding the polypeptide 
of SEQ ID NO:2. SEQ ID NO:4. 

1 1 . An isolated polynucleotide comprising the polynucleotide of SEQ ID NO: 1 , SEQ ID 

NO:3. 

12. An isolated polynucleotide comprising a nucleotide sequence encoding the polypeptide 
of SEQ ID NO:2 ? SEQ ID NO:4 obtainable by screening an appropriate library under 
stringent hybridization conditions with a labeled probe having the sequence of SEQ ID 
NO: 1 , SEQ ID NO:3 or a fragment thereof. 

13. An expression vector or a recombinant live microorganism comprising an isolated 
polynucleotide according to any one of claims 6-12. 

1 4. A host cell comprising the expression vector of claim 13 or a subcellular fraction or a 
membrane of said host cell expressing an isolated polypeptide comprising an amino acid 
sequence that has at least 85% identity to the amino acid sequence selected from the group 
consisting of: SEQ ID NO:2, SEQ ID NO:4. 

15. A process for producing a polypeptide comprising an amino acid sequence that has at 
least 85% identity to the amino acid sequence selected from the group consisting of: SEQ 
ID NO:2, SEQ ID NO:4 comprising culturing a host cell of claim 14 under conditions 
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sufficient for the production of said polypeptide and recovering the polypeptide from the 
culture medium. 

1 6. A process for expressing a polynucleotide of any one of claims 6-12 comprising 
transforming a host cell with the expression vector comprising at least one of said 
polynucleotides and culturing said host cell under conditions sufficient for expression of 
any one of said polynucleotides. 

1 7. A vaccine composition comprising an effective amount of the polypeptide of any 
one of claims 1 to 5 and a pharmaceuticallv acceptable carrier. 

1 8. A vaccine composition comprising an effective amount of the polynucleotide of any- 
one of claims 6 to 12 and a pharmaceuticallv effective carrier. 

1 9. The vaccine composition according to either one of claims 17 or 18 wherein said 
composition comprises at least one other Moroxella catarrhalis antigen. 

20. An antibody immunospecific for the polypeptide or immunological fragment as 
claimed in any one of claims 1 to 5. 

21. A method of diagnosing a Moraxella catarrhalis infection, comprising identifying a 
polypeptide as claimed in any one of claims 1 - 5, or an antibody that is immunospecific 
for said polypeptide, present within a biological sample from an animal suspected of 
having such an infection. 

22. Use of a composition comprising an immunologically effective amount of a 
polypeptide as claimed in any one of claims 1 - 5 in the preparation of a medicament for 
use in generating an immune response in an animal. 
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23. Use of a composition comprising an immunologically effective amount of a 
polynucleotide as claimed in any one of claims 6-12 in the preparation of a medicament 
for use in generating an immune response in an animal. 

24. A therapeutic composition useful in treating humans with Moraxella catarrhal^ 
disease comprising at least one antibody directed against the polypeptide of claims 1 - 5 
and a suitable pharmaceutical carrier. 
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Figure 2 : Alignment of the BASB027 polynucleotide sequences. 
Identity to SeqID No:l is indicated by a dot. 

2C * 40 

Seqidi : A T G C G T AA T T C AT AT T T T AAAG GTTTTCAGGTCAGTGC AAT G AC AA T G G C : 50 

5eqid3 : : 5C 

60 80 100 
Seqidi : T G T C AT G AT G G T AA7 G T C AAC T C AT G C AC AAG C G G C G G AT T 7 7 A T G G C A»A : 100 
Seqid3 : : 100 

120 * 140 
Seqidi : ATGACATTACCATCACAGGACTACAGCGAGTGACCATTGAAAGCTTACAA : 150 
Seqid3 : G : 150 

160 * 180 * 200 
Seqidi : AGCGTGCTGCCGTTTCGCTTGGGTCAAGTGGTGAGCGAAAACCAGTTGGC : 200 
Seqid3 : GCA : 200 

220 * 240 
Seqidi : TGATGGTGTCAAAGCACTTTATGCAACAGGCAATTTTTCAGATGTGCAAG : 2 50 
Seqid3 : : 250 

260 * 280 + 300 
Seqidi : TCTATCATCAAGAAGGGCGTATCATCTATCAGGTAACCGAAAGGCCGTTA : 300 
Seqid3 : : 300 

320 * 340 
Seqidi : ATCGCTGAGATTAAT7TTGAGGGCAATCGCTTAATTCCAAAAGAAGGTCT : 350 
Seqic3 : : 350 
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360 - 580 * 40C 

Seqicl : AC AA G AA.G G G C T AAAAAAT G C T G G C T T AG C T G T G G G T C AAC C AC T AAAA C : 4 CO 

SeqicO : : 4C0 

420 * 440 

Seqicl : AA G C C A C A G T A C A G A T G AT C G AAA C C G AG C T T A C C AA T C AA T A TAT AT C A : 450 

Sec:c3 : : 450 

4 60 * 4 30 * 500 

Seqidl : C -AAG G C T A T T A T AAT AC C G AAAT T AC T G T C AAAC AG A C GAT GCTTGATGG : 500 

SeqidO : : 500 

520 * 540 

Seqidl : T AAT C G T G T T AAG C T T G AT AT G ACC T T T G C T G AAG G T AAAC CTGCACGGG : 550 

Seqic3 : : 550 

560 * 58C * 600 

Seqidl : TGGTTGATATTAATATCATTGGCAATCAGCATTTTAGCGATGCAGATTTG : 600 

SeqidS : : 600 

620 - 640 

Seqidl : ATTGATGTGCTTGCGATTAAGGATAATAAAATCAATCCACTGTCTAAAGC : 650 

Seqid3 : : 650 

660 * 680 * 700 

Seqidl : TGACCGTTATACTCAAGAAAAGCTGGTGACCAGTTTAGAGAATTTGCGTG : 7 00 

Seqid3 : : 700 

720 * 740 

Seqidl : CTAAATATCTCAATGCAGGGTTTGTGCGTTTTGAGATTAAAGATGCTAAG : 7 50 
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Secid3 : : 750 

7 60 * 780 * 8 00 
Secicl : C T 7 A_AT A T T AA T G AA G A T AAAAA C C G T A T C T T T G T T G AG AT T 7 C A T T G C A : 3 00 
Seqid3 : : 800 

S20 840 
Seqidi : T G AAG G T G AG C AAT A T C G C T T T G G AC AG AC AC AG TTTTTGGGT A-A T T T AA. : 8 50 
Seqic3 : : 850 

360 - 880 - 900 
SeqidI : CTTATACTCAAGCAGAACTTGAGGCACTGCTTAAATTCAAAGCAGAAGAA : 900 
SeqidB : : 900 

920 * 940 
SeqidI : GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAA : 950 
Seqid3 : : 950 

960 * 980 * 1000 

SeqidI : ATTTGGTGACGATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCA : 1000 

Seqid3 : : 1000 

1020 + 1040 

SeqidI : TTAATGATGAAAGTCGTACGGTTGATGTGGAATATTATATTGACCCTGtA : 1050 

Seqid3 : : 1050 

1060 - 1080 * 1100 

Secidl : CACCCTGTCTATGTACGCCGTATTAATTTTACAGGTAACTTTAAGACCCA : 1100 

Seqid3 : : 1100 
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1120 - 1140 

Seqidl : AG AT G AA.G T A C T C C G T C G T G AG AT G C G AC AA.C T T G AA.G G 7 GC G 7 T G G C A T : 1150 

Seqici2 : : 115C 

1160 - 1180 ' 1200 

Seqidl : C T AA T C AAAAAA T C C A G C T G T C T C G T G C AC G C T 7 G A.7 G C G G AC T G G G T T T : 12CC 

Seqid3 : : 120G 

1220 * 1240 

Seqidl : T 7 T AAA.C AT GTTACCGTT G A TACTCGTCCAGT AC C C AA. C T C AC C T G AT C A : 1250 

Seqid3 : : 1250 

1260 * 1280 * 1300 

Seqidl : GGTTGATGTAAAT7TTGTGGTTGAAGAACAACCTTCAGGATCATCAACCA : 1300 

Seqid3 : : 1300 

1320 * 1340 

Seqidl : TCGCAGCAGGCTACTCTCAAAGTGGTGGTGTAACTTTTCAATTTGATGTT : 1350 

Seqid3 : : 1350 

1360 * 138C * 1400 

Seqidl : TCTCAAAATAACTTTATGGGTACAGGTAAGCACGTCAATGCTTCGTTTTC : 14 00 

Seqid3 : : 1400 

1420 * 1440 * 

Seqidl : TCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACCAACCCATACT : 14 50 

Seqid3 : : 1450 

1460 * 1480 * 1500 

Seqidl : TTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC : 1500 

Seqid3 : : 1500 
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152C * 1540 

Seqidl : AAG TAT G AT AAC AAG AAC AT T AG T AAT T AT G T AC T T G A T T C T T A T G G T G G : 1 550 

Seqid3 : : 1550 

1560 * 15S0 * 160C 

Seqici : C T C AT T AA.G C T A T G G AT .AT C C A-AT T G AT G A.AAAT C AAC G C AT A.AG C T T T G : 1 600 

Seqid2 : : 1600 

1620 - 1640 

Seqidl : GTCTGAA.TGCTGACAATACCAAGCTTCATGGCGGTCGTTTTATGGGCATT : 1650 

Seqid: : : 1650 

1660 * 1680 + 1700 

Seqidl : AGTAATGTCAAGCAGCTGATGGCAGATGGTGGCAAAATTCAAGTGGATA.A : 1700 

Seqid3 : : 1700 

1720 * 1740 

Seqidl : TAATGGCATTCCTGATTTTAAGCATGATTACACAACCTACAATGCCATTT : 17 50 

Seqid3 : : 1750 

1760 * 1780 * 1800 

Seqidl : TGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC : 18 00 

Seqid3 : : 1800 

1820 * 1840 

Seqidl : ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCA : 18 50 

Seqid3 : : 185C 



Seqicil 



1860 * 1880 * 1900 

AAAAGTGGTTTATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAG 



1900 
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Seqid2 : : 1907- 

1920 - 1940 

Seqidl : T C 7 T G C G 7 G G A 7 A "7 C AAG 7 7 AG G 2 7 A.7G G C AAT AA 7 7 7 A 3 C A 7 7 7 7 A 7 : 1950 

Seqicl : : 1950 

1961 * 193C ' 2000 

Seqidl : GAAAA777C7A7GCAGGCGGC7A7GG77CGG77CG7GGC7A7GA7CAA7C : 2000 

Seqic3 : : 2O0C 

2020 - 2040 

Seqidl : C7C777GGG7CCACGC7CACAAGCC7A777GACAGC7CG7CG7GG7CAAC : 20 50 

Seqid3 : : 2050 

2060 - 2080 - 210C 

Seqidl : AAAC C AC AC 7AGGAGAGG7TGTTGG7G G 7 AAT G C T 7 T G G C AAC T 7 7 C G G C : 2100 

Seqid3 : : 2100 

2120 + 2140 

Seqicl : AG7GAGC7GA7777ACC777GCCA777AAAGG7GA77GGA7AGA7CAGG7 : 215 0 

Seqid3 : : 2150 

2160 * 2180 * 2200 

Seqidl : GCG7CCAG7GA7A77GA77GAGGGCGG 7 CAGGT7777GA7ACA-ACAGG7A : 2200 

Seqid3 : : 2200 

2220 * 224C 

Seqidl : 7 G G A7 AAAC AAAC C A.T 7 G A.7 T7 AAC C C AA7 7 7 AAAG ACC C AC AAGC AAC A : 22 5 0 

Seqid3 : : 2250 
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2260 * 2 2 5 : - 2300 
SeqidI : -2 2 7 G A_A C AAAA.7 G C AiAAAG C AG C C A_A T C G C C 2 2 C T AC T AA C C C AAG A T AA : 22CC 
Secic2 : : 2302 



232C ' 2340 

SeqidI : A 2 AG T T G C G T 7 AT AG T GC T GG T G T 7 G G T G C AAC T T G G T AT ACG C C C A7 TG : 2 35 2 

Seqidj : : 2250 

2360 ~ 2330 * 2400 

SeqidI : G T C C T 7 7 A 7 C T A T T A G C T A T G C C AAG C C A T T G AA T AAAAAA.C AAAAT G A T : 2 4 00 

Seqid3 : : 2400 



SeqidI 
Seqid3 



2420 - 2440 
CAGACCGATACGGTACAGTTCCAGATTGGTAGTGTCTTTTAA : 24 4 2 
: 2442 
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Figure 3 : Alignment of the BASB027 polypeptide sequences. 
Identity to SeqID No:2 is indicated by a dot. 
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21' + 4 0 - 

Seqicj : HRNSYFKGFOVSAHTMAV^r/X^ : 50 

Seqid4 : A : 50 

6C . ' 8 0 - ICO 

Seqid2 : SVLPrRLGQVVSENQLADGVKALYATGNrSDVQv YHQEGRI I YQVTERFL : 10C 

Seqid4 : A : 100 

120 * 140 

Seqicj : I AEI NFEGNRLI PKZGLQEGLKNAGLAVGQPLKQATVQMI ETELTNQYI S : 150 

SeqicU : : 150 

160 - 180 * 200 

Seqid2 : QGYYNTEITV KQ T M L DG N R V K L D MT F AE G K P A R W D I N 1 1 G N Q H FS D A D L : 200 

Seqid4 : : 200 

220 * 240 

Seqid2 : I DVLAI KDNKINPLSK.ADRYTQEKLVTSLENLRAKYLNAGFVRFEI KDAK : 250 

Seqid4 : : 250 

260 - 280 v 300 

Seqid2 : LNINEDKNRI FVE I SLHEGZQYRFGQTQFLGNLTYTQAELEALLKFKAEE : 300 

Seqid4 : : 300 

320 * 340 

Seqid2 : GFSQAMLEQTTNNI STKFGDDGYY YAQI RPVTRINDESRTVDVEYYI D?V : 350 

Seqiti4 : : 350 
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3 5C - 33 C * ICQ 
Seqid2 : H ? V Y V ?. R I N F T G N* F K T Q 3 E V L R R E M R.Q LEG ALA S N Q K I Q L S RA L M RTGF : 4 0 ? 
Seqiti4 : : 400 

4 20 * 44 0 

S-iqicl. : FKHVTVD7RP7PN5 PDQVDV^FVVEEQPSGSSTZ AAGYSQSGGVTFQFDV : 450 



4 o C . * 4 SO ~ 500 

SeqidT: : SQWN FMGTGKH VNA3 FSR5ETREVYSLGMTWPYFTVNGVSQSLSGYYRKT : 5 0C 

SeqicM : : 50C 

520 * 540 

SeqidZ' : KYDNKNISNYVLDSYGGSLSYGYPIDENQRISFGLNADNTKLKGGRFMGI : 550 

Seqid4 : : 550 

560 * 580 * 600 

Seqid2 : S N V KQ LMA DG G K I Q V DNNG I P D FKH D YT T YN A I L G WN YSSLDRPVFPTQG : 600 

Seqi.d4 : : 60C 

620 * 640 

Seqid2 : M5HSVDLTVGFGDKTHQKVVYQGNI YRPFIKKSVLRGYAKLGYGNNLPFY : 650 

Seqid4 : , '. . : 650 

660 * 680 * 700 ' 

Seqid2 : ENFYAGGYGSVRGYDQSSLGPRSQAYLTARRGQQTTLGEVVGGNALATFG : 7 00 

Seqid4 : : 700 

720 * 74 0 * 

SeqidZ: : SELI LPLPFKGDWI DQVRPVI FIEGGQVFDTTGMDKQTI DLTQFKDPQAT : 7 50 
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Seoic-i : : 75C 



SeaidZ : A £ Q N A KAAN R F L L T Q D KQ L ?. V SAG V G A T W Y ? ? I G r 1 S I S V A K z 



Seqidj : QTDTVQF QI GSVF : S13 
Seaid-J : : 313 
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Figure 4: Coomassie stained SDS-PAGE of purified BASB027 protein. 
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Figure 5: Western blot with tera-His antibody of purified 8ASB027 protein. 
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Figure 6: Western h\oi oi purified BASB027 protein with the corresponding anti- 
recombinant protein sera. Panel A: pre-iromune serum. Panel B: immune serum. 
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Figure 7: Western blot oiVhoie lysates of 16 strains of M. catarrhaiis using 
pooled sera against the BASB027 protein (sera was diluted 1:2000). 
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Figure 8: Western biot of purified recombinant BASBG27 with corresponding «nti- 
peptide sera. Lanes 2 and 3 non immune sera. Lanes 1 and 4 immune sera. 
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SEQUENCE LISTING 
<110> SmithKline BeechaT, Biological s 
<120> Novel Compounds 



<130> BM45324 
<160> 10 

< 1 "7 G > FastSEO for Windows Version 3.0 

<210> 1 

< 2 II > 2442 
<212> DNA 
<213> Bacteria 

<400> 1 

atgcgtaacc catacttcaa aggttctcag gtcagtgcaa tgacaatggc tgtca-gatg 60 

gtaacgtcaa ctcatgcaca agcggcggat tttatggcaa acgacatcac caccacagga 120 

ctacagcgag tgaccattga aagcctacaa agcgtgccgc cgtttcgctt gggtcaagcg 180 

gtgagcgaaa accagttggc tgatggtgtc aaagcacttt atgcaacagg caatctctca 240 

gatgcgcaag tctatcatca agaagggcgt accatctatc aggnaaccga aaggccgcta 300 

accgctgaga ttaattctga gggcaaccgc ctaattccaa aagaaggtct acaagaaggg 360 

ccaaaaaatg ctggcttagc tgtgggtcaa ccaccaaaac aagccacagt acagacgatc 420 

gaaaccgagc tcaccaatca acatatanca caaggctatc acaataccga aattaccgcc 480 

aaacagacga tgcttgatgg taatcgtgtc aagcttgata tgacctttgc tgaaggtaaa 540 

cctgcacggg tggttgatat taacatcact ggcaatcagc attttagcga tgcagattcg 600 

attgacgcgc ttgcgattaa ggataataaa atcaatccac tgtctaaagc tgaccgttat 660 

ac:caagaaa agctggtgac cagtttagag aatttgcgtg ctaaatatct caacgcaggg 720 

tttgtgcgtt ttgagattaa agatgctaag cttaatatta atgaagataa aaaccgtatc 780 

tttgttgaga tttcattgca tgaaggtgag caatatcgct ttggacagac acagtttttg 840 

ggcaatttaa cttatactca agcagaactt gaggcactgc ttaaattcaa agcagaagaa 900 

gggt Lttcac aagccatgct tgagcaaaca acaaacaata tcagtaccaa atttggtgac 960 

gatggctatt attatgctca aatccgccct gtaacacgca ttaatgatga aagccgtacg 1020 

gtcgatgtgg aatattatat tgaccctgta caccctgcct atgtacgccg tattaacttt 1080 

acaggtaact ttaagaccca agatgaagta ctccgtcgtg agacgcgaca acttgaaggt 1140 

gcgtnggcat ctaatcaaaa aanccagctg tctcgtgcac gcctgatgcg gactgggntt 1200 

tttaaacatg ttaccgttga tactcgtcca gtacccaact cacctgacca ggttgatgta 1260 

aattttgtgg ttgaagaaca accctcagga tcatcaacca tcgcagcagg ctactctcaa 1320 

a 9t93tggtg taacctttca atttgatgtt tctcaaaata actttatggg tacaggtaag 1380 

cacgccaatg cttcgttttc tcgctctgag acccgtgagg tgtatagttt gggtatgacc 1440 

aacccacact ttaccgtaaa tggcgtctcg caaagcttga gtggctacca tcgcaaaacc' 1500 

aagtatgata acaagaacat tagtaatta: gnacttgatt cttatggtgg cccattaagc 1560 

tatggatatc caattgatga aaatcaacgc ataagctttg gtctgaatgc tgacaatacc 1620 

aagcttcatg gcggtcgttt tatgggcacc agtaatgtca agcagctgat ggcagatggt 1680 

ggcaaaattc aagtggataa taatggcatt cccgatttta agcatgatta cacaacctac 1740 

aatgccattt tggggtggaa ttactcaagt ccagatcgcc ctgtatttcc aacccaaggc 1800 

acgagccatt ctgtagattt gacggttggt tttggtgata aaactcatca aaaagtggtt 1860 

tatcaaggca atatctatcg cccattcacc aaaaaatcag tcttgcgtgg atacgccaag 1920 

titaggctatg gcaacaattt accattttat gaaaatttct atgcaggcgg ctacggttcg 1980 

gtccgcggcc atgaccaacc ctctttgggt ccacgctcac aagcctattt gacagctcgc 2040 

cgtggtcaac aaaccacact aggagaggtt: gt tggtggta t atgct tcggc aactttcggc 2100 

agcgagctga cttcaccttt gccatttaaa ggtgattgga tagatcaggt gcgtccagtg 2160 

atattcattg agggcggcca ggcttttga- acaacaggta tggataaaca aaccattgac 2220 

ccaacccaat ttaaagaccc acaagcaaca gctgaacaaa atgcaaaagc agccaaccgc 2280 

ccgctaccaa cccaagataa acagtcgcgt cacagtgctg gtgttggtgc aacttggtat 2340 
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acgcccattg gtcctttatc tattagctat gccaagccac cgaataaaaa acaaaatga: 2400 
cagaccgata cggtacagtt ccagattggt agtgtctttt aa 2442 

<210> 2 

<211> 813 

<212> PRT 

<213> Bacteria 

<400> 2 

Met Arg Asn Ser Tyr Phe Lys Gly Phe Gin Val Ser Ala Th- Met 

5 10 15 

A:a Val Mec Met Val Met Ser Thr His Ala Gin Ala Ala Aso Phe Met 

20 25 3C : 

«ia asp. Asp lie Thr lie Thr Gly Leu Gin Arg Val Thr lie Giu Ser 

35 40 45 

i,eu Gm Ser Val Leu Pro Phe Arg Leu Gly Gin Val Val Ser Glu Asn 

50 55 60 

Gin Leu Ala Asp Gly Val Lys Ala Leu Tyr Ala Thr Glv Asn Phe Ser 
6:> 7 ° 75 * 80 

Asp Val Gin Val Tyr His Gin Glu Gly Arg He lie Tyr Gin Val Thr 

85 90 * 95 

Glu Arg Pro Leu He Ala Glu He Asn Phe Giu Glv Asn Arg Leu He 

100 105 no 

Pro Lys Giu Giy Leu Gin Glu Gly Leu Lys Asn Ala Gly Leu Ala Val 

115 120 125 

Gly Gin Pro Leu Lys Gin Ala Thr Val Gin Met lie Giu Thr Glu Leu 

130 135 140 

Thr Asn Gin Tyr He Ser Gin Gly Tyr Tyr Asn Thr Glu lie Thr Val 
145 150 155 iso 

Lys Gin Thr Met Leu Asp Gly Asn Arg Val Lys Leu Asp Met Thr Phe 

165 170 175 

Ala Glu Giy Lys Pro Ala Arg Val Val Asp He Asn He He Gly Asn 

180 185 190 

Gin His Phe Ser Asp Ala Asp Leu lie Asp Val Leu Ala He Lvs Asp 

195 200 205 

Asn Lys He Asn Pro Leu Ser Lys Ala Asp Arg Tyr Thr Gin Glu Lys 

210 215 220 

Leu Val Thr Ser Leu Giu Asn Leu Arg Ala Lys Tyr Leu Asn Ala Gly 
225 230 235 240 

Phe Val Arg Phe Glu He Lys Asp Ala Lys Leu Asn He Asn Glu Asp 

245 250 255 

Lys Asn Arg He Phe Val Glu He Ser Leu His Giu Glv Glu Gin Tyr 

260 265 * 270 

Arg Phe Gly Gin Thr Gin Phe Leu Gly Asn Leu Thr Tyr Thr Gin Ala 

275 280 285 

Glu Leu Glu Ala Leu Leu Lys Phe Lys Ala Glu Glu Glv Phe Se- Gin 

290 295 300 

Ala Met Leu Glu Gin Thr Thr Asn Asn He Ser Thr Lys Phe Gly Asp 
305 310 3is 320 

Asp Gly Tyr Tyr Tyr Ala Gin He Arg Pro Val Thr Arg He Asn Asp 

325 330 335 

Glu Ser Arg Thr Val Asp Val Glu Tyr Tyr He Asp Pro Val His Pro 

340 345 350 

Val Tyr Val Arg Arg He Asn Phe Thr Gly Asn Phe Lys Thr Gin Asp 

355 360 365 

Glu Val Leu Arg Arg Giu Met Arg Gin Leu Glu Gly Ala Leu Ala Ser 

3 7 0 375 380 

Asn Gin Lys He Gin Leu Ser Arg Ala Arg Leu Met Arg Thr Gly Phe 
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385 




















3 55 










4 0 0 


Phe 


Lys 


His 


Val 


Thr 


Val 


Asp 


Thr 


Arg 


Pro 


Val 


Pro 


Asn 


C <a v- 


Pro 


Asp 










405 










410 










4 15 


Gin 


Val 


Asp 


Val 
420 


Asn 


Phe 


Val 


Val 


Glu 
425 


Glu 


Gin 


Pro 


Ser 


Gly 
4 3- 


Ser 


Ser 


Thr 


He 


Ala 
435 


Ala 


Gly 


T v r 


Ser 


Gin 
440 


Ser 


Gly 


Gly 


Val 


Thr 
445 




• j i n 


Phe 


Asp 


Val 
450 


Ser 


Gin 


Asn 


Asn 


Phe 
455 


Met 


Gly 


Thr 


Gly 


Lys 
'4 6 0 


His 


Val 


Asn 


Ala 


Ser 


Phe 


Ser 


Arg 


Ser 


Glu 


Thr 


Arg 


Glu 


Val 


Tyr 


Ser 


Leu 


Gly 


Met 


Thr 


465 




















475 










480 


Asn 


Pro 


Tyr 


Phe 


Thr 

4S5 


Val 


Asn 


Gly 


Val 


Ser 
490 


Gin 


Ser 


Leu 


Ser 


G ± y 
495 


Tyr 


Tyr 


Arg 


Lys 


Thr 
500 


Lys 


Tyr 


Asp 


Asn 


Lys 
505 


Asn 


He 


Ser 


Asn 


Tvr 


Val 


Leu 


Asp 


Ser 


Tyr 


Gly 


Gly 


Ser 


Leu 


Ser 


Tyr 


Gly 


Tyr 


Pro 


He 


Asp 


Glu 


Asn 






515 










520 










525 






Gin 


Arg 
530 


He 


Ser 


Phe 


Gly 


Leu 
535 


Asn 


Ala 


Asp 


Asn 


Thr 
540 


Lys 


Leu 


His 


Gly 


Gly Arg 


Phe 


Met 


Gly 


He 


Ser 


Asn 


Val 


Lys 


Gin 


Leu 


Met 


Al a 


Asp 


Gly 


545 










550 










555 










560 


Gly 


Lys 


lie 


Gin 


Val 
56 5 


Asp 


Asn 


Asn 


Gly 


He 
570 


Pro 


Asp 


Phe 


i-j y s 


His 
575 


Asp 


Tyr 


Thr 


Thr 


Tyr 


Asn 


Ala 


He 


Leu 


Gly 


Trp 


Asn 


Tyr 


Ser 


Ser 


Leu 


Asp 








590 










585 










590 




Arg 


Pro 


Val 
555 


Phe 


Pro 


Thr 


Gin 


Gly 
600 


Met 


Ser 


His 


Ser 


Val 
605 


Asp 


Leu 


Thr 


Val 


Gly 


Phe 


Gly 


Asp 


Lys 


Thr 


His 


Gin 


Lys 


Val 


Val 


Tyr 


Gin 


Gly 


Asn 




610 










615 










620 








He 


Tyr 


Arg 


Pro 


Phe 


He 


Lys 


Lys 


Ser 


Val 


Leu 


Arg 


Gly 


Tyr 


Ala 


Lys 


625 










630 










635 








640 


Leu 


Gly 


Tyr 


Gly 


Asn 


Asn 


Leu 


Pro 


Phe 


Tyr 


Glu 


Asn 


Phe 


Tyr 


Ala 


Gly 










645 










650 








6 5 5 


Gly 


Tyr 


Gly 


Ser 


Val 


Arg 


Gly 


Tyr 


Asp 


Gin 


Ser 


Ser 


Leu 


Gly 


Pro 


Arg 








660 










665 










670 




Ser 


Gin 


Ala 
675 


Tyr 


Leu 


Thr 


Ala 


Arg 
680 


Arg 


Gly 


Gin 


Gin 


Thr 
685 


Thr 


Leu 


Gly 


Glu 


Val 
690 


Val 


Gly 


Gly 


Asn 


Ala 
695 


Leu 


Ala 


Thr 


Phe 


Gly 
700 


Ser 


Glu 


Leu 


He 


Leu 


Pro 


Leu 


Pro 


Phe 


Lys. 


Gly 


Asp 


Trp 


He 


Asp 


Gin 


Val 


Arg 


Pro 


Val 


705 










710 










715 








720 


He 


Phe 


He 


Glu 


Gly 
725 


Gly 


Gin 


Val 


Phe 


Asp 
730 


Thr 


Thr 


Gly 


Met 


Asp 
735 


Lys 


Gin 


Thr 


He 


Asp 
740 


Leu 


Thr 


Gin 


Phe 


Lys 
745 


Asp 


Pro 


Gin 


Ala 


Thr 
750 


Ala 


Glu 


Gin 


Asn 


Ala 


Lys 


Ala 


Ala 


Asn 


Arg 


Pro 


Leu 


Leu 


Thr 


Gin 


Asp 


Lys 


Gin 






755 










760 










765 






Leu 


Arg 


Tyr 


Ser 


Ala 


Gly 


Val 


Gly 


Ala 


Thr 


Trp 


Tyr 


Thr 


Pro 


He 


Gly 




770 










775 








780 








Pro 


Leu 


Ser 


He 


Ser 


Tyr 


Ala 


Lys 


Pro 


Leu 


Asn 


Lys 


Lys 


Gin 


Asn 


Asp 


785 










790 










795 






800 


Gin 


Thr 


Asp 


Thr 


Val 
805 


Gin 


Phe 


Gin 


He 


Gly 
810 


Ser 


Val 


Phe 









<210> 3 

<211> 2442 

<212> DNA 

<213> Bacteria 
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atgcgcaa:: ca:attccaa aggttttcag gtcagtgeaa tgacaatggc tgtcatgatg £Cr 

c-aacgccaa ctcatgcaca agcggcggat tttatggcaa acgacatcgc catcacagga 12C 

ccacagcgag tgaccattga aagcttacaa agcgtgctgr cgtttcgrtt ggatcaagtg 1SZ 

gcgagcgaag cacagttggc tgatggtgtc aaagcac::: acgcaacagg caactcucca 24 : 

gacgcgcaag cc:accatca agaagggcgt a:ca:cta:c aggtaaccga aaggccgtta 3 0: 

atcgcrtgaga ctaatcrcga gggcaatcgc ccaa:cccaa aagaaggtct acaagaaggg 25C 

ctaaaaaatg cccgcccagc tgtgggtcaa ccactaaaac aagccacag: acagatgatc 4 2 : 

gaaaccgag: traccaacca ataiatacca caaggcta:: ataataccga aattactgtc 4 3 j 
aaacagacga tgcttgatgg taatcgtgtt aagr:tgaca cgacc:ttgc tgaaggtaaa 

cccgcacggg tggttgatat caacatcat: ggcaatcagc attt tagcga tgcagatccg 60C 

atcga:g:gc cigcgattaa ggataanaaa accaatccac tgtctaaagc cgaccg:ca: 560 

actcaagaaa agccggtgac cagtttagag aa:tcgcg:g ccaaaca::: caacgcaggg 7 2 C 

1 1 tgtgcgt t c:gagar:aa agatgctaag cctaata::a atgaagataa aaaccgtatc 7SC 

tttgttgaga c:ccaccgca tgaaggtgag caacatcgc: ::ggacagac acagctmng 54 C 

ggtaatttaa c:tatactca agcagaactt gaggcactgc ttaaattcaa agcagaagaa 900 

gggttttcac aagccatgct tgagcaaaca acaaacaata :cagcaccaa atttggtgac 96C 

gatggctatt a:catgctca aatccgtcc: gcaacacgca ctaatgatga aagtcgtacg 1020 

gttgatgtgg aa:a:cacat tgaccctgta caccctgrcc atgtacgccg tattaattt: 10c0 

acaggtaact c:aagaccca agatgaagta ctecgtcgcg agacgcgaca acttgaaggt I14C 

gcgttggcat ctaatcaaaa aatccagccg cctcgtgcac gcttgatgcg gactgggttt 120C 

tttaaacatg ccaccgttga tactcgtcca gcacccaac: cacctgatca ggttgatgta 1260 

aattttgtgg ttgaagaaca accttcagga tcatcaacca tcgcagcagg ctactctcaa 132C 

agtggtggtg taacttttca atttgatgtt tctcaaaata actttatggg tacaggtaag 138C 

cacgtcaatg cttcgttttc tcgctccgag acccgtgagg tgtatagttt gggtatgacc 1440 

aacccatac: ttaccgtaaa tggcgtctcg caaagcttga gtggctacta tcgtaaaacc I50G 

aagcaugaca acaagaacat tagtaattat gtacctga;t cttatggtgg ctcatcaagc 1560 

tacggacacc caattgatga aaatcaacgc ataagctttg gtctgaatgc tgacaatacc 1620 

aagcttcatg gcggtcgttt tatgggcatt agtaatgtca agcagctgat ggcagatggt 1680 

ggcaaaattc aagtggataa taatggcatc cctgatttta agcatgatta cacaacctac 1740 

aatgccactt tggggtggaa ctattcaagt ccagatcgcc ctgtatttcc aacccaaggc 18C0 

acgagtcac: ctgcagatct gacggttggt tttggtgata aaactcatca aaaagtggtt 1860 

taccaaggca atatctatcg cccatttatc aaaaaatcag tcttgcgtgg atacgccaag 1920 

ttaggctatg gcaataattt accatrtcat gaaaatttct atgcaggcgg ctatggttcg 1980 

gttcgtggct atgatcaatc ctctttgggt ccacgctcac aagcctattt gacagctcgt 2040 

cgtggtcaac aaaccacact aggagaggtt gtcggtggta atgctttggc aactttcggc 210C 

agtgagctga ttttaccttt gccatctaaa ggtgattgga tagatcaggt gcgtccagcg 2160 

atattcattg agggcggtca ggtttttgat acaacaggta tggataaaca aaccattgat 2220 

ctaacccaat tuaaagaccc acaagcaaca gccgaacaaa acgcaaaagc agccaatcgc 2280 

ccgctaccaa cccaagacaa acagctgcgt tatagtgctg gtgttggtgc aacttggtat 2340 

acgcccatcg gtcctttatc cattagctat gccaagcca: tgaataaaaa acaaaacgat 2400 

cagaccgata cggtacagtt ccagattggt agtgtctttt aa 2442 

<210> 4 
<211> 813 
<212> PRT 
<213> Bacteria 

< 4 00 > 4 

Met Arg Asn Ser Tyr Phe Lys Gly Phe Gin Val Ser Ala Met Thr Met 

1 5 io 15 

Ala Val Met Met Val Met Ser Thr His Ala Gin Ala Ala Asp Phe Met 

20 25 30* 

Ala Asn Asp He Ala lie Thr Gly Leu Gin Arg Val Thr He Glu Ser 

3S 40 45 

Leu Gin Ser Val Leu Pro Phe Arg Leu Gly Gin Val Val Ser Glu Ala 
sc 55 so 
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Gin Leu Ala Asp Gly Val Lys Ala Leu Tyr Ala Thr Glv Asn Phe Ser 
55 7C 75 * £ - 

Asp Val Gin Val Tyr His Gin Giu Gly Arg He He Tvr Gin Val Thr 

85 go * c= 

Glu Arg Pro Leu He Ala Glu He Asn Phe Glu Gly Asn Arg Leu He 

100 105 " no 

Pro Lys Glu Gly Leu Gin Glu Gly Leu Lys Asn Ala Glv Leu Ala Val 

115 120 125 

Gly Gin Pro Leu Lys Gin Ala Thr Val Gin Met He Glu Thr Glu Leu 

130 135 140 

Thr Asn Gin Tyr He Ser Gin Gly Tyr Tyr Asn Thr Glu He Tr- Val 
145 150 155 1€0 

Lys Gin Thr Met Leu Asp Gly Asn Arg Val Lys Leu Asp Met Thr Phe 

165 170 175 

Ala Glu Gly Lys Pro Ala Arg Val Val Asp He Asn He He Gly Asn 

ISO 185 190 

Gin His Phe Ser Asp Ala Asp Leu He Asp Val Leu Ala He Lvs Asp 

I 95 200 205 

Asn Lys He Asn Pro Leu Ser Lys Ala Asp Arg Tyr Thr Gin Glu Lvs 

210 215 220 

Leu Val Thr Ser Leu Glu Asn Leu Arg Ala Lvs Tvr Leu Asn Ala Glv 
225 230 2 35 * 240 

?ne Val Arg Phe Glu He Lys Asp Ala Lys Leu Asn He Asn Glu Asp 

24 5 250 255 

Lys Asn Arg He Phe Val Glu He Ser Leu His Giu Gly Glu Gin Tvr 

260 265 * 270 

Arg Phe Gly Gin Thr Gin Phe Leu Gly Asn Leu Thr Tyr Thr Gin Ala 

275 280 285 

Giu Leu Giu Ala Leu Leu Lys Phe Lys Ala Glu Glu Gly Phe Ser Gin 

2 9° 295 300 

Ala Met Leu Giu Gin Thr Thr Asn Asn He Ser Thr Lys Phe Giv Asp 
305 310 315 - 320 

Asp Gly Tyr Tyr Tyr Ala Gin He Arg Pro Val Thr Arg lie Asn Asp 

325 330 335 

Glu Ser Arg Thr Val Asp Val Giu Tyr Tyr He Asp Pro Val His Pro 

340 345 350 

Val Tyr Val Arg Arg He Asn Phe Thr Gly Asn Phe Lys Thr Gin Asp 

355 360 365 

Giu Val Leu Arg Arg Glu Met Arg Gin Leu Glu Gly' Ala Leu Ala Ser 

370 375 380 

Asn Gin Lys He Gin Leu Ser Arg Ala Arg Leu Met Arg Thr Glv Phe 
385 390 395 " 400 

Phe Lys His Val Thr Val Asp Thr Arg Pro Val Pro Asn Ser Pro Asp 

4 °5 410 ' 415 

Gin Val Asp Val Asn Phe Val Val Glu Glu Gin Pro Ser Gly Ser Ser 

420 425 430 

Thr He Ala Ala Gly Tyr Ser Gin Ser Gly Gly Val Thr Phe Gin Phe 

43 5 440 445 

Asp Val Ser Gin Asn Asn Phe Met Gly Thr Gly Lys His Val Asn Ala 

450 4 55 460 

Ser Phe Ser Arg Ser Giu Thr Arg Glu Val Tyr Ser Leu Gly Me- Thr 
465 47 C 475 ' 480 

Asn Pro Tyr Phe Thr Val Asn Gly Val Ser Gin Ser Leu Ser Gly Tyr 

485 490 495 

Tyr Arg Lys Thr Lys Tyr Asp Asn Lys Asn He Ser Asn Tvr Val Leu 

500 505 510 

Asp Ser Tyr Gly Gly Ser Leu Ser Tyr Gly Tyr Pro He Asd Glu Asn 
515 520 525 
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Gin Arg lie Ser Phe Gly Leu Asn Ala Asd Asn Thr Lvs Leu His Glv 

530 535 540 

Gly Arg Phe Met Gly He Ser Asn Val Lys Gin Leu Me: Ala Asp Gly 
545 550 c 55 56 q 

Gly Lys lie Gin Val Asp Asn Asn Gly He Pro Asp Phe Lys His Asp 

555 57C * 575 

Tyr Thr Thr Tyr Asn Ala He Leu Gly Tro Asn Tvr Ser Ser Leu Asd 

530 58 5 5go 

Arg Pro Val Phe Pro Thr Gin Gly Met Ser His Ser Val Asd Leu Thr 

555 600 605 

Val Gly Phe Gly Asp Lys Thr His Gin Lys Val Val Tvr Gin Glv Asn 

610 615 £20 

He Tyr Arg Pro Phe He Lys Lys Ser Val Leu Arg Gly Tyr Ala Lys 
625 _ 630 635 " * 640 

Leu Gly Tyr Gly Asn Asn Leu Pro Phe Tyr Giu Asn Phe Tyr Ala Gly 

64 5 650 ' 655 

Gly Tyr Gly Ser Val Arg Gly Tyr Asp Gin Ser Ser Leu Gly Pro Arg 

660 665 670 

Ser Gin Ala Tyr Leu Thr Ala Arg Arg Gly Gin Gin Thr Thr Leu Gly 

675 680 685 

Giu Val Val Gly Gly Asn Ala Leu Ala Thr Phe Glv Ser Glu Leu He 

690 695 700 

Leu Pro Leu Pro Phe Lys Gly Asp Trp He Asn Gin Val Arg Pro Val 
705 710 715 720 

He Phe He Glu Gly Gly Gin Val Phe Asp Thr Thr Gly Met Asp Lys 

725 730 * 735 

Gin Thr He Asp Leu Thr Gin Phe Lys Asp Pro Gin Ala Thr Ala Glu 

740 745 750 

Gin Asn Ala Lys Ala Ala Asn Arg Pro Leu Leu Thr Gin Asd Lys Gin 

755 760 7 6 5 

Leu Arg Tyr Ser Ala Gly Val Gly Ala Thr TrD Tvr Thr Pro He Gly 

770 775 * 780 

Pro Leu Ser lie Ser Tyr Ala Lys Pro Leu Asn Lys Lvs Gin Asn Asp 
785 790 795 800 

Gin Thr Asp Thr Val Gin Phe Gin He Gly Ser Val Phe 
805 810 

<210> 5 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 5 
actatagggc acgcgtg 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 6 
cctgcgtttg cttgattgag 



17 
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<210> 7 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 7 



aagggc-caa ::acgcagac gagatccaca ggactacagc gagtgaccac tgaaagcua 60 



c 



61 



< 2 1 0 > 6 
<21i> 67 

< 2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igcnucleot ide 
<400> 8 

aagggcccaa ttacgcagag ggtcgactta ttaaaagaca ctaccaatct qgaactatac 60 
cgtatcg " - g7 

<210> 9 

<211> 16 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Oligopeptide 

<400> 9 

Cys Tyr Ala Lys Pro Leu Asn Lys Lys Gin Asn Asp Gin Thr Asp Thr 
1 5 10 - 15 

<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Oligopeptide 
<400> 10 

Tyr Leu Thr Ala Arg Arg Gly Gin Gin Thr Thr Leu Gly Glu Val Val 

15 10 15 

Cvs 
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SEQUENCE INFORMATION 

BASB027 Polynucleotide and Polypeptide Sequences 



SEQ ID NO:l 

Moraxella catarrhalis BASB027 polynucleotide sequence from strain ATCC 43617 



ATGCGTAATTCATATTTTAAAGGTTTTCAGGTCAGTGCAATGACAATGGCTGTCATGATG 

GTAATGTCAACTCATGCACAAGCGGCGGATTTTATGGCAAATGACATTACCATCACAGGA 

CTACAGCGAGTGACCATTGAAAGCTTACAAAGCGTGCTGCCGTTTCGCTTGGGTCAAGTG 

GTGAGCGAAAACCAGTTGGCTGATGGTGTCAAAGCACTTTATGCAACAGGCAATTTTTCA 

GATGTGCAAGTCTATCATCAAGAAGGGCGTATCATCTATCAGGTAACCGAAAGGCCGTTA 

ATCGCTGAGATTAATTTTGAGGGCAATCGCTTAATTCCAAAAGAAGGTCTACAAGAAGGG 

CTAAAAAATG CTGGCTT AG CTGTGGGTC AACCACTAAAACAAG C CACAGT AC AGATGATC 

GAAACCGAGCTTACCAATCAATATATATCACAAGGCTATTATAATACCGAAATTACTGTC 

AAACAGACGATGCTTGATGGTAATCGTGTTAAGCTTGATATGACCTTTGCTGAAGGTAAA 

CCTGCACGGGTGGTTGATATTAATATCATTGGCAATCAGCATTTTAGCGATGCAGATTTG 

ATTGATGTGCTTGCGATTAAGGATAATAAAATCAATCCACTGTCTAAAGCTGACCGTTAT 

ACTCAAGAAAAGCTGGTGACCAGTTTAGAGAATTTGCGTGCTAAATATCTCAATGCAGGG 

TTTGTGCGTTTTGAGATTAAAGATGCTAAGCTTAATATTAATGAAGATAAAAACCGTATC 

TTTGTTGAGATTTCATTGCATGAAGGTGAGCAATATCGCTTTGGACAGACACAGTTTTTG 

GGTAATTTAACTTATACTCAAGCAGAACTTGAGGCACTGCTTAAATTCAAAGCAGAAGAA 

GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAAATTTGGTGAC 

GATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCATTAATGATGAAAGTCGTACG 

GTTGATGTGGAATATTATATTGACCCTGTACACCCTGTCTATGTACGCCGTATTAATTTT 

ACAGGTAACTTTAAGACCCAAGATGAAGTACTCCGTCGTGAGATGCGACAACTTGAAGGT 

GCGTTGGCATCTAAT CAAAAAATCC AG CTGT CTCGTG CACGCTTGATG CGGACTGGGTTT 

TTTAAACATGTTACCGTTGATACTCGTCCAGTACCCAACTCACCTGATCAGGTTGATGTA 

AATTTTGTGGTTGAAGAACAACCTTCAGGATCATCAACCATCGCAGCAGGCTACTCTCAA 

AGTGGTGGTGTAACTTTTCAATTTGATGTTTCTCAAAATAACTTTATGGGTACAGGTAAG 

CACGTCAATGCTTCGTTTTCTCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACC 

AACCCATACTTTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC 

AAGTATGATAACAAGAACATTAGTAATTATGTACTTGATTCTTATGGTGGCTCATTAAGC 

TATGGATATCCAATTGATGAAAATCAACGCATAAGCTTTGGTCTGAATGCTGACAATACC 

AAGCTTCATGGCGGTCGTTTTATGGGCATTAGTAATGTCAAGCAGCTGATGGCAGATGGT 

GGCAAAATTCAAGTGGATAATAATGGCATTCCTGATTTTAAGCATGATTACACAACCTAC 

AATGCCATTTTGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC 

ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCAAAAAGTGGTT 

TATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAGTCTTGCGTGGATACGCCAAG 

TTAGGCTATGGCAATAATTTACCATTTTATGAAAATTTCTATGCAGGCGGCTATGGTTCG 

GTTCGTGGCTATGATCAATCCTCTTTGGGTCCACGCTCACAAGCCTATTTGACAGCTCGT 

CGTGGTCAACAAACCACACTAGGAGAGGTTGTTGGTGGTAATGCTTTGGCAACTTTCGGC 

AGTGAGCTGATTTTACCTTTGCCATTTAAAGGTGATTGGATAGATCAGGTGCGTCCAGTG 

ATATTCATTGAGGGCGGTCAGGTTTTTGATACAACAGGTATGGATAAACAAACCATTGAT 

TTAACCCAATTTAAAGACCCACAAGCAACAGCTGAACAAAATGCAAAAGCAGCCAATCGC 

CCGCTACTAACCCAAGATAAACAGTTGCGTTATAGTGCTGGTGTTGGTGCAACTTGGTAT 
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V 

ACG CC C AT T GG T C CTTT AT CT ATT AG CTATGCCAAGCCATTGAATAAAAAACAAAATGAT 
CAGACCGATACGGTACAGTTCCAGATTGGTAGTGTCTTTTAA 



SEQ ID NO:2 

Moraxella catarrhalis BASB027 polypeptide sequence deduced from the 
polynucleotide sequence of SEQ ID NO:l 

M R N S Y F KG FQV SAMTMAVMI^^ 

V3ENQLADGVKALYATGNFSDVQVYHQEGRIIYQVTERPLIAEINFEGNRLIPK£GLQEG 
LKNAGLAVGQPLKQATVQMIETELTNQYISQGYYNTEITVKQTMLDGNRVKLDMTFAEGK 
PARVVDINI IGNQHFSDADLIDVLAIKDNKINPLSKADRYTQEKLVTSLENLRAKYLNAG 
FVRFEIKDAKLNINEDKNRIFVEISLHEGEQYRFGQTQFLGNLTYTQAELEALLKFKAEE 
G F S QAM L E Q TT NN ISTKFGDDGY Y Y AQ I R P VT R I ND E S RT VD VE Y Y I D P VK P V Y VR R I N F 

tgnff^tqdevlrremrqlegalasnqkiqlsrarlmrtgffkhvtvdtrpvpnspdqvdv 
nfweeqpsgsstiaagysqsggvtfqfdvsqnnfmgtgkhvnasfsrsetrevyslgmt 
npyftvmgvsqslsgyyrktkydnknisnyvldsyggslsygypidenqrisfglnadnt 
klhggrfmgisnvkqlmadggkiqvdnngipdfkot^ 

MSHSVDLTVGFGDKTHQKWYQGNIYRPFIKKSVLRGYAKLGYGNNLPFYENFYAGGYGS 

vrgydqsslgprsqayltarrgqqttlgewggnalatfgselilplpfkgdwidqvrpv 

ifieggqvfdttgmdkqtidltqfkdpqataeqnakaanrplltqdkqlrysagvgatwy 
tpigplsisyakplnkkqndqtdtvqfqigsvf 



SEQ ID NO:3 

Moraxella catarrhalis BASB027 polynucleotide sequence from strain ATCC 43617 



atgcgtaattcatattttaaaggttttcaggtcagtgcaatgacaatggctgtcatgatg 
gtaatgtcaactcatgcacaagcggcggattttatggcaaatgacattaccatcacagga 
ctacagcgagtgaccattgaaagcttacaaagcgtgctgccgtttcgcttgggtcaagtg 
gtgagcgaaaaccagttggctgatggtgtcaaagcactttatgcaacaggcaatttttca 
gatgtgcaagtctatcatcaagaagggcgtatcatctatcaggtaaccgaaaggccgtta 
atcgctgagattaattttgagggcaatcgcttaattccaaaagaaggtctacaagaaggg 
ctaaaaaatgctggcttagctgtgggtcaaccactaaaacaagccacagtacagatgatc 
gaaaccgagcttaccaatcaatatatatcacaaggctattataataccgaaattactgtc 
aaacagacgatgcttgatggtaatcgtgttaagcttgatatgacctttgctgaaggtaaa 
cctgcacgggtggttgatattaatatcattggcaatcagcattttagcgatgcagatttg 
attgatgtgcttgcgattaaggataataaaatcaatccactgtctaaagctgaccgttat 
actcaagaaaagctggtgaccagtttagagaatttgcgtgctaaatatctcaatgcaggg 
tttgtgcgttttgagattaaagatgctaagcttaatattaatgaagataaaaaccgtatc 
tttgttgagatttcattgcatgaaggtgagcaatatcgctttggacagacacagtttttg 
ggtaatttaacttatactcaagcagaacttgaggcactgcttaaattcaaagcagaagaa 
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GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAAATTTGGTGAC 
GATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCATTAATGATGAAAGTCGTACG 
GTTGATGTGGAATATTATATTGACCCTGTACACCCTGTGTATGTACC-CCGTATTAATTTT 
ACAGGTAACTTTAAGACCCAAGATGAAGTACTCCGTCGTGAGATGCGACAACTTGAAGGT 
GCGTTGGCATCTAATCAAAAAATCCAGCTGTCTCGTGCACGCTTGATGCGGACTGGGTTT 
TTTAAACATGTTACCGTTGATACTCGTCCAGTACCCAACTCACCTGATCAGGTTGATGTA 
AATTTTGTGGTTG AAG AAC AAC CTT C AGGAT CAT C AAC CATCG C AG C AG G CT ACT CT CAA 
AGTGGTGGTGTAACTTTTCAATTTGATGTTTCTCAAAATAACTTTATGGGTACAGGTAAG 
CACGTCAATGCTTCGTTTTCTCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACC 
AACCCATACTTTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC 
AAGTATGATAACAAGAACATTAGTAATTATGTACTTGATTCTTATGGTGGCTCATTAAGC 
TATGGATATCCAATTGATGAAAATCAACGCATAAGCTTTGGTCTGAATGCTGACAATACC 
AAGCTTCATGGCGGTCGTTTTATGGGCATTAGTAATGTCAAGCAGCTGATGGCAGATGGT 
GGC AAAATT C AAGTGG AT AAT AAT GG CATT C CTG ATTTTAAG C ATG ATT AC AC AAC CT AC 
AATGCCATTTTGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC 
ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCAAAAAGTGGTT 
TATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAGTCTTGCGTGGATACGCCAAG 
TTAGGCTATGGCAATAATTTACCATTTTATGAAAATTTCTATGCAGGCGGCTATGGTTCG 
GTTCGTGGCTATGATCAATCCTCTTTGGGTCCACGCTCACAAGCCTATTTGACAGCTCGT 
CGTGGTCAACAAACCACACTAGGAGAGGTTGTTGGTGGTAATGCTTTGGCAACTTTCGGC 
AGTGAGCTGATTTTACCTTTGCCATTTAAAGGTGATTGGATAGATCAGGTGCGTCCAGTG 
ATATTCATTGAGGGCGGTCAGGTTTTTGATACAACAGGTATGGATAAACAAACCATTGAT 
TTAACCCAATTTAAAGACCCACAAGCAACAGCTGAACAAAATGCAAAAGCAGCCAATCGC 
CCGCTACTAACCCAAGATAAACAGTTGCGTTATAGTGCTGGTGTTGGTGCAACTTGGTAT 
ACGCCCATTGGTCCTTTATCTATTAGCTATGCCAAGCCATTGAATAAAAAACAAAATGAT 
CAGACCGATACGGTACAGTTCCAGATTGGTAGTGTCTTTTAA 

SEQ ID NO:4 

Moraxella catarrhalis BASB027 polypeptide sequence deduced from the 
polynucleotide sequence of SEQ ID NO:3 

MRNSYFKGFQVSAMTMAVMNWMSTH^ 

VSENQLADGVKALYATGNFSDVQVYHQEGRIIYQVTERPLIAEINFEGNRLIPKEGLQEG 
LKNAGLAVGQPLKQATVQMIETELTNQYISQGYYNTEITVKQTMLDGNRVKLDMTFAEGK 
PARWDINI IGNQHFSDADLIDVLAIKDNKINPLSKADRYTQEKLVTSLENLRAKYLNAG 
FVRFEIKDAKLNINEDKNRIFVEISLHEGEQYRFGQTQFLGNLTYTQAELEALLKFKAEE 
GFSQAMLEQTTNNISTKFGDDGYYYAQIRPVTRINDESRTVDVEYYIDPVHPVYVRRINF 
TGNFKTQDEVLRREMRQLEGALASNQKIQLSRARLMRTGFFKHVTVDTRPVPNSPDQVDV 
NFWEEQPSGSSTIAAGYSQSGGVTFQFDVSQNNFMGTGKHVNASFSRSETREVYSLGMT 
NPYFTVNGVSQSLSGYYRKTKYDNKNISNYVLDSYGGSLSYGYPIDENQRI3FGLNADNT 
KLKGGRFMGISNVKQLMADGGKIQVDNNGI PDFKHDYTTYNAILGWNYSSLDRPVFPTQG 
MSHSVDLTVGFGDKTHQKVVYQGNI YRPFIKKSVLRGYAKLGYGNNLPFYENFYAGGYGS 
VRGYDQSSLGPRSQAYLTARRGQQTTLGEWGGNALATFGSELILPLPFKGDWIDQVRPV 
IFIEGGQVFDTTGMDKQTIDLTQFKDPQATAEQNAKAANRPLLTQDKQLRYSAGVGATWY 
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» # 



WO 99/63093 



PCT/EP99/0382 



TPIGPLSISYAKPLNKKQNDQTDTVQFQIGSVF 
SEQ ID NO : 5 

ACT ATA GGG CAC GCG TG 

SEQ ID NO: 6 

CCT GCG TTT GTT TGA TTG AG 

SEQ ID NO: 7 

AAG GGC CCA ATT ACG CAG AGG GGA TCC ACA GGA CTA CAG CGA G 

ACC ATT GAA AGC TTA C 

SEQ ID NO: 8 

AAG GGC CCA ATT ACG CAG AGG GTC GAC TTA TTA AAA GAC ACT 
AAT CTG GAA CTG TAC CGT ATC G 

SEQ ID NO: 9 

CYAKPLNKKQNDQTDT 
SEQ ID NO: 10 

YLTARRGQQTTLGEWC 
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